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Preface 


Starting from the development of transistor technology to laser technology, the field 
of solid state devices and their circuit applications has advanced rapidly. The silicon 
bipolar junction transistor was first applied to low frequency circuits. The subsequent 
advances in materials science made it possible to fabricate compound semiconductor 
transistors capable of operating at microwave frequencies and high speeds. This pre- 
sented the capability of applications in both analogue and digital circuits. At the same 
time, the wide choice of high performance semiconductor materials also enabled the 
development of optoelectronic devices such as lasers and light-emitting diodes. The 
communications industry continues to grow and diversify, thus necessitating the design 
of circuits which will satisfy the requirements of mobile telephones which are becom- 
ing more and more sophisticated in their performance. Circuit design has applications 
in other areas such as optical communications. 

This book focusses on high-speed electronics and optoelectronics where the devices 
operate at frequencies >1 GHz. It is presented in two parts with devices being dis- 
cussed in the first part and the circuit applications in the second part. In Part One, 
semiconductor devices fabricated in a variety of material systems — Si, III]—V compound 
semiconductors and SiGe — are presented. We discuss the concepts and the fundamen- 
tal principles of operation. We do not attempt to present the latest results as they will 
already be obsolete by the time the book is published. It is assumed that the reader has 
had a course in fundamental solid state physics. 

Chapter 1 reviews semiconductor materials and physics. For the reader who is famil- 
iar with the topics, this chapter will be a brief review. If not, the reader can go to the 
references section to get a detailed coverage of the topics. Semiconductor materials are 
described followed by brief discussions of crystal structure and bonding. The section 
on quantum mechanics is intended to present only the important concepts and is not 
a comprehensive treatment of the subject. Semiconductor properties are described fol- 
lowed by types of semiconductors. Semiconductor junctions are treated in detail as they 
are the basis of the devices to be treated in subsequent chapters. 

Chapter 2 presents high-frequency/high-speed electronic devices starting with the 
MESFET, which was the first transistor to operate at microwave frequencies. The devel- 
opment of the high electron mobility transistor (HEMT) represented a major advance in 
technology and is presented here in detail. The recent application of MOSFETs to radio 
frequency has been successful and the properties are covered in detail. Finally, bipolar 
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and heterojunction bipolar transistors (HBTs) are described. Models for the transistors 
are presented and their method of implementation is described. 

Chapter 3 presents the optimisation and parameter extraction of the circuit models of 
the electronic devices. The simulated annealing algorithm is discussed followed by the 
application of neural networks to circuit modelling. The genetic algorithm is defined 
and its application to optimisation is shown. Parameter extraction methods are given for 
circuit models using semi-analytical methods and basic expressions are derived. 

Chapter 4 deals with various optical sources such as light-emitting diodes and lasers, 
giving details of their physical properties and their modes of operation. The discussion 
of emitters is followed by an extensive coverage of a variety of photodetectors. 

In Part Two of the book, we discuss analogue circuits at the gate level. We will 
assume that the reader has a background (at the undergraduate level) in fundamental 
analogue circuit theory. Chapter 5 (Part Two of the book) deals with the components of 
high-speed analogue circuits. After a review of scattering parameter theory, the power 
and noise relations for two-port networks are discussed. Transistor amplifiers are cov- 
ered in detail, showing the application of the devices described in Chapter 2. This is 
followed by a discussion of oscillators and mixers for high-speed circuits. Important 
passive components of high-speed circuits complete this chapter. 

We have a layered approach to each chapter in the book. There is an executive 
summary at the beginning of each chapter. This will make the book valuable also for 
technical managers who may not want to go through the chapter content in detail. We 
have extensive problems at the end of each chapter, which will give the student appli- 
cations of the theory. This book should be useful to research engineers and graduate 
students. Results from various research papers are presented, many of which are only 
available in journals which are referenced extensively. However, the reader need not 
go to the original papers as the results are given in sufficient detail to give a good 
understanding of the material. 
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1.1 


Review of semiconductor materials 
and physics 


Executive summary 


Semiconductor devices are fabricated using specific materials that offer the desired 
physical properties. There are three classes of solid state materials: insulators, semi- 
conductors and conductors. This distinction is based on the electrical conductivity of 
these materials with insulators having the lowest and conductors having the highest con- 
ductivity. Semiconductors fall in between and their conductivity is affected by several 
factors such as temperature, the incidence of light, the application of a magnetic field 
and impurities. This versatility makes semiconductors very important in electronics and 
optoelectronics applications. 

Semiconductors themselves are divided into two classes: elemental and compound. 
Each type has distinctive physical properties which are exploited in device design. Typ- 
ical elemental semiconductor device materials are silicon and germanium; examples of 
compound semiconductors are GaAs, InP, AlGaAs and SiGe. The single crystal struc- 
ture of these materials is that of a periodic lattice and this determines the properties 
of the semiconductors. Silicon has the diamond crystal structure and the compound 
semiconductors have the zincblende lattice structure. The bonding between atoms in a 
crystal of the semiconductors is termed covalent bonding, where electrons are shared 
between atoms. Fundamental principles of quantum mechanics are applied to determine 
the energy band structure of the semiconductor. 

The basic device physics involves the description of the energy band structure, the 
density of states, the carrier concentration and the definition of donors and acceptors. 
Semiconductors are categorised as direct or indirect depending on the bandgap. The 
absorption mechanism is described and radiation and recombination processes impor- 
tant to device performance are detailed. The two carrier transport processes are drift 
and diffusion. The currents due to these transport processes are expressed in terms of 
the applied electric field, the carrier mobility and the carrier concentration. The junction 
formed by p-type semiconductor (excess holes) and n-type semiconductor (excess elec- 
trons) is described and the characteristics of such a junction are given. The important 
Schottky diode, a junction formed by a metal and a semiconductor layer (n-doped in 
this case) is characterised. 

Heterostructures formed by dissimilar semiconductors are important in device design. 
The properties of heterojunctions of semiconductor materials are presented. Silicon— 
germanium heterojunctions are of particular interest as high performance electronic 
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Table 1.1 Portion of the periodic table showing semiconductor 
material elements 


Period Group III Group IV Group V 


2 B Cc N 
Boron Carbon Nitrogen 
3 Al Si P 
Aluminium Silicon Phosphorus 
4 Ga Ge As 
Gallium Germanium Arsenic 
5 In Sn Sb 
Indium Tin Antimony 


Table 1.2 Elemental and binary compound semiconductors 


Elements IV-IV Binary compounds _ [III-V Binary compounds 
Si Silicon SiC Silicon carbide AlAs Aluminium arsenide 
Ge Germanium SiGe Silicon germanium AIP Aluminium phosphide 


AlSb Aluminium antimonide 
BN Boron nitride 

GaAs Gallium arsenide 

GaN Gallium nitride 

GaSb Gallium antimonide 
InAs Indium arsenide 

InP Indium phosphide 

InSb Indium antimonide 


Table 1.3 Ternary and quaternary semiconductors 


Ternary compounds Quaternary compounds 
AlxGa,_xAs AlxGa,_,AsySbj_y 

Aluminium gallium arsenide Aluminium gallium arsenic antimonide 
GaAs j_xPx GaxInj_xAsj_—yPy 

Gallium arsenic phosphide Gallium indium arsenic phosphide 


devices have been designed using this material alloy. This chapter gives a detailed 
discussion of these heterojunctions. 


Semiconductor materials 


Materials used for semiconductors fall into two categories: elemental semiconductors 
and compound semiconductors. Table |.1 shows the section in the periodic table which 
has the semiconductor elements and Table |.2 lists examples for elemental and binary 
compound semiconductors. Some ternary and quaternary semiconductors are listed in 
Table 1.3. 
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Fig. 1.1 


Fig. 1.2 
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Types of solids 


There are three types of solids: crystalline, polycrystalline and amorphous. The arrange- 
ment of atoms is periodic in three dimensions in a crystalline solid with forces binding 
the atoms together. This periodicity exists over the entire crystal and it will appear the 
same regardless of the region where the crystal is viewed. If the periodicity of the atoms 
occurs over a small region of the solid and changes in different regions of the solid, the 
solid is termed to be polycrystalline. Atoms in amorphous solids exhibit no periodicity. 
Figure |.1 shows the three different types of solids. 


Crystal structure 


Semiconductor materials such as Si, Ge and GaAs that are to be used for devices are 
crystalline, that is, a single crystal. This periodic arrangement of atoms in a crystal is 
termed a lattice and the distance between the atoms is the lattice constant. The unit 
cell is a fundamental unit in the crystal and a repetition of the unit cell generates the 
entire lattice. The unit cell is not unique and can be chosen in various ways as shown 
in Figure |.2(a). This is a two-dimensional representation of the crystal lattice. The 
entire lattice can be constructed by translations of any of the three unit cells in two 
coordinate directions. The primitive unit cell is the smallest unit cell. A generalised 
primitive three-dimensional unit cell is shown in Figure |.2(b). The coordinate direc- 
tions are a,b,c. In cubic structures, these would be the rectangular coordinates. The 
basic cubic crystal structures are (a) the simple cubic, (b) the body-centred cubic and 


Pe e e 
oo eee Se. . 
oo —~ e e e ia 


(a) Crystalline (b) Polycrystalline (c) Amorphous 


Schematic arrangement of atoms in solids. 


ion 


a 


(a) Two-dimensional lattice — (b) Generalised primitive unit cell 
shaded areas show possible unit cells 


Unit cells. 
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(a) Simple cubic (b) Body-centred cubic —(c) Face-centred cubic 


Types of cubic lattices. 


(a) Diamond lattice and (b) Zincblende lattice. (S. M. Sze, Semiconductor Devices: Physics and 
Technology, John Wiley & Sons, 1985). Reprinted with permission of John Wiley & Sons, Inc. 


(c) the face-centred cubic shown in Figure |.3. The simple cubic lattice has an atom 
at each corner of the cube, where the length of a side of the cube is a, the lattice con- 
stant. The body-centred cubic lattice (BCC) has an additional atom in the centre of the 
cube and the face-centred cubic lattice (FCC) has an additional atom in the centre of 
each face of the cube. The two most important semiconductor crystal structures are the 
diamond lattice structure and the zincblende structure. Silicon and germanium have the 
diamond lattice structure and most of the binary compound semiconductors such as 
GaAs have the zincblende lattice structure. The only difference between the diamond 
and the zincblende structures is that the latter has two different types of atoms as seen 
in Figure 1.4. The diamond structure consists of two inter-penetrating FCC sublattices 
of atoms. The second FCC cube is shifted by one-fourth of the body diagonal, which 
is the longest diagonal. In the zincblende structure of GaAs, one sublattice has gallium 
atoms and the other has arsenic atoms. 


Crystal directions and planes 


Crystals are of finite size and hence have surfaces. It is necessary to define the planes at 
the crystal surfaces and the crystallographic directions, both of which determine the 
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x 


Representation of plane with Miller indices [6, 5, 8]. 


x 


Representation of direction with Miller indices [6, 5, 8]. 


properties of semiconductor devices. The rectangular coordinate system defines the 
cubic crystal and the plane surfaces and directions are described by a set of indices 
called the Miller indices. Planes are described by the indices (h,k,l) and the directions 
perpendicular to these planes are described by the same indices [hk]]. 


Example: Find the Miller indices of the plane which makes intercepts 3a,4a,2a along 
the coordinate axes in a cubic crystal, where a is the lattice constant. Draw the direction 
vector with the same Miller indices. 


Solution: The intercepts are 3, 4 and 2. The reciprocals are 1/3, 1/4 and 1/2. Multipli- 
cation by the lowest common denominator, which is 12, yields (4,3,6). These are the 
Miller indices which define the plane shown in Figure 1.5. It can be shown that parallel 
planes are described by the same Miller indices. 

The Miller indices of the direction are given as [436]. The intercepts on the three coor- 
dinate axes are 3, 4 and 2. The direction vector is drawn and seen to be perpendicular to 
the planes shown in Figure 1.6. 


The basic planes in cubic crystals are shown in Figure 1.7. It is also important to 
describe specific directions in a crystal in addition to the planes. As in the case of the 
crystal plane, a crystal direction is also described by three integers which are the com- 
ponents of a vector drawn in the particular crystal direction. The crystal planes and 
directions of most interest are shown in Figure |.8. The [hk]] direction is perpendicular 
to the (hkl) plane. 
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Basic crystal planes. 
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(a) (100) plane (b) (110) plane (c) (111) plane 


[100] direction [110] direction [111] direction 


Important crystal planes and directions. 


Atomic bonding 


Atoms are held together by bonding forces to form solids. When the attractive and 
repulsive forces are equivalent, the atoms are in equilibrium and maintain the spacing 
characterised by the lattice constant, a. There are different bonding classifications which 
are described by the dominant force of attraction. When one of the atoms gives up an 
electron in the outer shell to another atom, positive and negative ions are produced. 
There is a Coulomb interaction force of attraction between them. This is termed ionic 
bonding. At equilibrium, the forces of attraction and repulsion are equivalent. Sodium 
chloride (NaCl) and Potassium chloride (KCl) are examples of ionic bonding after the 
formation of the Nat and Cl7 ions. 


Covalent bonding 


This type of bonding results when electrons are shared by neighbouring atoms. The 
hydrogen atom is the simplest example of covalent bonding. Each of the two electrons 
bonds with the other to complete the lowest energy shell as shown in Figure 1.9. 

Each atom in a diamond or zincblende lattice has four nearest neighbours. Each atom 
has four electrons in the outer orbit. These are the valence electrons and each atom 
shares these valence electrons with its four neighbours. The interaction between the 
shared electrons results in bonding forces which are quantum mechanical in nature. In 
other words, each electron pair constitutes a covalent bond. Elements in group IV such 
as Si and Ge have four valence electrons as shown in the references [8, 14, 15]. These 
are available for bonding as seen in Figure 1.10. Compound semiconductors such as 
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(a) Valence electrons (b) Covalent bonding 


Covalent bonding in hydrogen. 


(a) Silicon atoms with four valence electrons (b) Covalent bonding 


Covalent bonding in silicon. 


GaAs exhibit both covalent as well as ionic bonding. This is due to the fact that Ga and 
As occur in two different groups in the periodic table and hence there is a transfer of 
charge resulting in some ionic bonding. 


Atomic physics 


The theories of atomic physics were based on experimental observations. These theories 
subsequently explained the experiments and led to the understanding of atoms in matter. 


The photoelectric effect 


The measurements of Planck on a heated sample of material indicated that energy is 
radiated in discrete units called quanta as shown in Equation (1.1). 


E=h, (1.1) 


where h (Planck’s constant) = 6.63 x 10774 J - s and v is the frequency of the radiation. 
Heinrich Hertz discovered the photoelectric effect in 1887. The experiments performed 
by Philipp Lenard, a former student of Hertz, showed that if light shines on a metal 
surface in vacuum, some of the electrons receive enough energy so that they are emit- 
ted from the surface into the vacuum. They were interpreted by Albert Einstein, who 
received the Nobel Prize for his work in 1921. This is termed the photoelectric effect and 
the maximum energy is a function of the frequency of the incident light. The quantised 
units of light energy are called photons. 
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Based on further experimental observations of Davisson and Germer (USA) and 
Thompson (UK) on the diffraction of electrons by the atoms in a crystal, de Broglie 
related the wavelength of a particle of momentum p = mv, where m is the mass of the 
particle as seen in Equation (1.2): 


a een (1.2) 


The Bohr model of the atom 


A model of the atom was first proposed by Bohr. In his model, the electrons move in 
stable circular orbits about the nucleus and the electron may move to an orbit of higher 
or lower energy. The electron would either gain energy or lose energy by the absorption 
or emission of a photon of energy Av. Bohr further proposed that the angular momentum 
of the electron moving in a circular orbit was an integral multiple of Planck’s constant 
as seen in Equation (1.3). 


nh 
Py = =— =nh, n=1,2,3.,... (1.3) 
20 
The hydrogen atom with one electron and the nucleus illustrates this concept in a simple 
manner as seen in Figure 1.11. 
Assuming that the electron of mass m rotates in a stable orbit of radius r with velocity 
v, the angular momentum is written in Equation (1.4): 


Pp, = mor =nh. (1.4) 


The electrostatic force between the charge on the nucleus and the charge on the electron 
must be equal to the centripetal force for the electron to remain in stable orbits. This 
yields the expression in Equation (1.5) for the energy of the electron [15]: 

mq* 


|e en ae 1.5 
" 2(4r €9)2n2h2 ne 


Bohr model of the hydrogen atom. 
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Electron orbits in Bohr model (not to scale). 


The electron orbits in the Bohr model are shown in Figure 1.12. 


The de Broglie relation 


The initial theoretical and experimental results of Planck, Einstein and Bohr laid the 
foundation for the development of quantum mechanics. It was de Broglie, however, 
who first postulated that if waves were seen to behave as particles then it could be that 
particles might behave like waves. 

In the Bohr formulation, the electron which travels in a circular orbit of radius r is 
assumed to behave like a wave with a wavelength A. It travels in a circular path equal 
in length to the circumference 2777, which will be an integral number of wavelengths so 
that 


nk = 27r. (1.6) 


The Bohr formulation yielded the linear velocity of the electron to be 


2 

q 
— . 7 
Ar eonh oe 


Using this velocity relation, the wavelength can be written as 


h=—=-, (1.8) 
mv p 


where p is the linear momentum of the electron. Thus, de Broglie postulated that the 
relationship between the wavelength and the linear momentum p of a particle is given 
by Equation (1.2). 


p= -=—— =hk. (1.9) 
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This is the de Broglie relationship. For free electrons, the energy-momentum relation- 
ship is as follows: 


2 
uv 
Pe ok! open) one, (1.10) 
2 2m 
Hence, the experiments of Davisson and Germer and of Thompson were verified by the 
de Broglie relationship. 


Quantum mechanics 


Newtonian mechanics can be used to describe physical behaviour that is macroscopic. 
Typical examples of this are planetary motion, the classical electromagnetic fields 
and fluid motion. The motion of electrons and the interaction of electrons in atoms 
in semiconductor materials cannot, however, be described thus since we are dealing 
with microscopic behaviour. This physical behaviour on the atomic scale can only 
be described by quantum mechanics rather than Newtonian mechanics. Quantum or 
wave mechanics had as its basis the physical understanding developed by Planck and 
de Broglie. The classical laws of the conservation of energy, momentum and angular 
momentum are also assumed to be valid in quantum mechanics. Hence, the physics 
involved in the interaction between atoms can be described mathematically by quantum 
mechanics. 


Probability and the uncertainty principle 


When the motion of the particle is microscopic, the parameters cannot be described 
exactly but rather in terms of average (expectation) values. Hence we have, for example, 
the expectation values of position, momentum and energy of an electron. So, we have a 
probabilistic rather than an exact description of the particle behaviour. There is, thus, an 
inherent uncertainty in the position and momentum of the particle. This was formulated 
by Heisenberg and is termed the Heisenberg uncertainty principle. The uncertainty in 
the measurement of the position and momentum of particle motion is given as 


(Ax)(Apx) 2 h. (1.11) 


The uncertainty in energy is related to the time at which the energy was measured and 
is given by 
(AE)(At) > fi. (1.12) 


These equations show that the simultaneous measurements of position and momen- 
tum on the one hand and energy and time on the other hand cannot be performed with 
arbitrary accuracy. 

It follows that we can only determine the probability of finding an electron in a certain 
position or having a certain momentum. This leads to the definition of a probability 
density function. The probability of finding a particle in a range, say, from x to x + dx 
is given by 
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Table 1.4 Classical variables and quantum operators 


Classical variable Quantum operator 
x x 
f(x) f(x) 
Momentum p(x) 4 2 
bocce 2 A292 
Kinetic energy — Tm o, 
Potential energy V Vv 
Total energy E = o 
CO 
i P(x)dx = 1, (1.13) 
=0O 


where P(x) is a normalised function. The average value of a function x is defined as 


(f (x)) -|/ S(x)P(x)dx = 1. (1.14) 


The correspondence between classical and quantum mechanical quantities is shown in 
Table 1.4. 

The basic principles of quantum mechanics will now be reviewed. Each particle in 
a physical system is described by a wave function W(x, y, z,t). The function and its 
space derivatives are continuous, finite and single-valued. 

The probability of finding a particle with wave function W in the volume dxdydz is 
W* Udxdydz. Then we have the following definition for three-dimensional space: 


00 
/ W* Wdxdydz = 1. (1.15) 
—0o 


The expectation value of any physical quantity X can be written as 
[o,@) 
<X>= iy W* X oper Vdxdydz, (1.16) 
—0o 


where X oper is the operator corresponding to the variable X. 
The classical equation for energy conservation is Kinetic energy + Potential energy = 
Total energy: 


FY (1.17) 
2m 


The wave equation 


We obtain the quantum mechanical energy equation by substituting the corresponding 
operators which operate on the one-dimensional wave function WV (x, f): 
—h? a? W(x, 1) —h aW(x, t) 


+ V(x) (x,t) = EV (x, t) = — 


— —_.— ——_—. 1.1 
2m ax? J ot a) 
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Infinite potential well, width = 2a. 


This is the one-dimensional Schrédinger wave equation. The three-dimensional wave 
equation is 


-h? _, —h aw 
— VU + Vix)W = EV = —_——_., (1.19) 
2m J 


The wave equation is applied to the solution of various physical problems. The problem 
of the infinite potential well provides an understanding of the method of solution and an 
insight into the discrete energies of a single electron [14, 15]. 

This basic physical concept is important since quantum wells can be fabricated using 
semiconductor structures for devices. A general solution of the one-dimensional wave 
equation can be written as follows: 


—jEt 
W(x, t) = W(x) exp (—*). (1.20) 


We consider the infinite quantum well of width 2a with zero potential outside the well 
as shown in Figure 1.13. 

On solving the one-dimensional wave function, we obtain solutions and the discrete 
energy levels are given by [14, 15], 


wh? n? 
Ey, = ——>; E21 
n 8moaz ( ) 


where mo is the rest mass of the electron and a is the lattice constant of the crystal. 
The one-dimensional problem of a particle in a finite potential well can also be solved 
and the allowed energies of the particle determined [10]. The phenomenon of tunnelling 
wherein an electron with energy E tunnels through a potential barrier with barrier height 
Vo greater than E is also explained by quantum mechanics. Classically, the electron 
would not be able to show this behaviour. If we have a potential barrier of width a, the 
one-dimensional Schrédinger equation can be solved in the three regions I, II and III as 
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Vo Vo 


Potential barrier. 


shown in Figure 1.14. There are three regions for the problem. Regions I and III have 
zero potential. Say region II has a potential Vo, then the solutions in the three regions 
are given by: 


: : . 2  2mE 
Region I: w(x) = Aexp(jkx) + B exp(—jkx); k° = 2 (1.22) 
. 2__ 2m(Vo — E) 
Region IT: w(x) = C exp(—ax) + Dexp(+ax); a* = — >_> (1.23) 
. : >  2mE 
Region IT: w(x) = F exp(jkx); k* = “eo (1.24) 


Using the conditions that the wave function and its derivatives are continuous at the 
boundaries, x = 0 and x = a, the tunnelling probability is of the form: 


aa 4 
Acosh?(ad) + (% — £)” sinh2(wd) 


a 


(1.25) 


r= 


Boundary conditions are matched at the two boundaries and T, the tunnelling probabil- 
ity is determined. 

The method of solution is the same regardless of the shape of the barrier. Triangu- 
lar and trapezoidal barriers have a simple geometry and hence give us exact solutions. 
When the barriers are of arbitrary shape, the tunnelling probability is solved using the 
Wentzel—Kramers—Brillouin (WKB) approximation: 


dy 
T = exp |-2 [ | f(x) | as] (1.26) 
d 


1 


with 
2mo 
ff) = wef — E}, (1.27) 
where V(x) is the arbitrary potential. The limits of the integral d; to dy represent the 


classically forbidden region, where the potential energy is larger than the total particle 
energy. 
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Statistical mechanics 


The free electron 


When the three-dimensional Schrédinger equation is solved, the general solution gives 
the wave function for the electron in motion in a region of zero potential. The 
behaviour of electrons in semiconductor crystals can be assumed to be like that of so- 
called free electrons under certain conditions, hence the importance of this result. The 
time-independent wave function solution is given by 


y(r) = Aexp(k-r), (1.28) 


where A is a complex quantity and is the amplitude, k is the wave vector and r 
is the three-dimensional space vector. This results in energies of the same form as 
Equation (1.21). 


Fermi-—Dirac distribution 


The Fermi—Dirac distribution function f (£) gives the probability that states with energy 
E are occupied by particles [10]: 


f(A) = (1.29) 


1 + exp (4a) ? 


Ef represents the Fermi energy where f(E) becomes equal to 1/2. 


Electrons in a semiconductor 


Since semiconductors have periodic lattice structures, the electrons are subjected to 
a periodic potential. Hence the Schrédinger equation must be solved for a periodic 
potential [10]. The Bloch theorem states that the one-dimensional wave function for 
an electron in a periodic potential is given by 


W(x) = Vax) exp(ykx), (1.30) 


where Vx(x) is a periodic potential with the same periodicity as the semiconductor 
crystal with lattice constant a such that 


Vi(x) = V(x + na), (1.31) 


where 7 is an integer. 


The Kronig—Penney model 


An important model for the band structure is the Kronig—Penney model (Figure 1.15). 
The one-dimensional periodic potential is given by 


V(x) =0, 0O< x <a (1.32) 
V(x) =Vo, —-b<x <0 (1.33) 
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Periodic potential for Kronig—Penney model. 


The periodicity distance is d = a + b. The wave equation is solved in the three regions 
and the continuity conditions for the wave function and its derivatives are applied. The 
non-trivial solutions are obtained when the electron energy is less than and greater than 
the potential Vo [10, 14]. The transcendental equation to be solved is 


2 2 

—6 
coskxd = cosaacosh bé — 2 sinaa@ sinh bé, 0 < E < Vo (1.34) 

a2 + §2 
coskxd = cosaa cos bé — sinaa sinbé, E > Vo (1.35) 
with 
2moE 2mo(E — Vo) 2mo(Vo — E) 

a , B=.|—_— ., 6 = .| —_—.. 1.36 
Pat.» Pe FB aay 


The solution of the equation gives the energy E. The allowed energy bands are sepa- 
rated by band gaps with no allowed energies. It follows that there are forbidden energy 
regions for an electron which is subjected to a periodic potential in a semiconductor 
crystal. 


Effective mass 


When the centre of mass of a classical particle moves with a velocity v, we define 
a phase velocity. If we have a packet of travelling waves with a centre frequency 
q@ and a wavenumber k, we have the classical dispersion relation for the group 
velocity: 

_ dw 


=F: (1.37) 


Ug 


In the quantum mechanical formulation, the wavepacket is the analogue of the classical 
particle in a given region of space. This wavepacket consists of constant-energy wave 
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function solutions and a centre energy is defined. Hence the wavepacket group velocity 
in the quantum-mechanical formulation can be written as in Equation (1.20): 


1ldE (1.38) 
vg = -—. : 

* A dk 

Using the force-momentum relations, we define the effective mass of an electron in a 

crystal as 
-1 

1 aE 

*= | : 1.39 


Section 1.14.1 defines heavy and light holes corresponding to wide and narrow bands 
respectively. 


Carriers in semiconductors 


The two types of carriers in semiconductors are the conduction band electrons and the 
valence band holes. The electrons occupy the conduction band when the temperature is 
raised above 0 K. The unoccupied states in the valence band are holes and are defined 
to have a positive charge with the same magnitude as the electronic charge. Hence, we 
consider electrons in determining the conduction band properties and holes in deter- 
mining the valence band properties. The band structures of several semiconductors are 
given by Pierret, and Streetman and Banerjee [10, 15] and others. 


Semiconductors in equilibrium 


Intrinsic semiconductors 


A semiconductor is described as being intrinsic when there are no impurities and no 
defects in the crystal. The concentration of electrons in the conduction band is equal 
to the concentration of holes in the valence band. At 0K, the electrons occupy all the 
available energy states in the valence band and all the states in the conduction band 
are empty. This follows from the fact that at 0 K, each electron is in the lowest possi- 
ble energy state. As the temperature is increased the electrons are excited due to the 
acquired thermal energy and move into the conduction band leaving behind holes in the 
valence band. Therefore, the equilibrium concentration of electrons in the conduction 
band no is equal to the equilibrium concentration of holes in the valence band po in 
intrinsic semiconductors [2, 15]: 


no = Po = Ni, (1.40) 


where nj is simply referred to as the intrinsic concentration of holes and electrons. 


Extrinsic semiconductors 


When impurity atoms are added to the intrinsic semiconductor such that the electron 
concentration is no longer equal to the hole concentration, it becomes an extrinsic 
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Band diagram. 


semiconductor and ng ~ po. Thus the doping of a semiconductor with impurities can 
produce excess electrons or holes. These atoms can be either donors or acceptors. If the 
dopant produces an excess of electrons, the dopant is referred to as a donor, the semi- 
conductor becomes n-type material with n > p and the current is predominantly due to 
the negatively charged electrons. If, on the other hand, the dopant generates holes, the 
dopant is referred to as an acceptor, the result is a p-type semiconductor with p > n 
and the current is predominantly due to the positively charged holes. Note that the hole 
charge has the same magnitude as the electronic charge [2, 8, 15]. 


Semiconductor band diagrams 


The band diagrams for p- and n-type semiconductors at thermal equilibrium are given in 
Figure |.16. The bottom of the conduction band is E¢, the top of the valence band is Ey, 
the intrinsic energy level is at mid-band and is denoted by £; and the Fermi level is Ep. 


Electron and hole distribution 


The distribution of electrons in the conduction band and holes in the valence band is 
obtained using the Fermi—Dirac probability function. The electron distribution in the 
conduction band is written as 


n(E) = gc(E) f(E), (1.41) 


where g,(£) is the density of quantum states in the conduction band and f(£) is the 
Fermi-—Dirac probability function given in Equation (1.29). The hole distribution in the 
valence band can be written in a similar way: 


P(E) = gv(E)U — f(E)I. (1.42) 


The density of states functions are written as 


* /Im*(E—E 
iy oma ) E> E, (1.43) 


&c(E) = re 
ms [2m (Ev — E) 
gv(E) = 1, Es Ey. (1.44) 


The equilibrium concentration of electrons can now be written as 


no= | n(E)dE, (1.45) 
E. 
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where n(E£) is given by Equation (1.41). Similarly, the equilibrium hole concentration 
is written as 


Ey 
po= | p(E)dE, (1.46) 


where p(£) is given by Equation (1.42). The equilibrium electron and hole concentra- 
tions in the conduction and valence bands respectively are written as 


-~(E, — E 

(oa N. exp ( ( iT 2) (1.47) 
nee 

po = Ny exp (4) ; (1.48) 


where N, and Ny are the effective density of states functions in the conduction and 
valence bands respectively. 


2nm*kT \>/* 
Ne =2 a (1.49) 
2nm*kT a 


The intrinsic carrier concentration nj; is given by 
2 _ 
n; = NnoPpo- (1.51) 
By substitution of Equations (1.47) and (1.48), we can write the intrinsic concentra- 
tion as 


(1.52) 


—(E,-— E 
ne = NeNy exp (“5 ™) 


kT 


=. 1.53 
iT (1.53) 


= N,N, exp 


where Eg is the bandgap energy. 


Direct and indirect semiconductors 


When light illuminates a semiconductor, and the photon energy is equal to or larger than 
the band gap, the light is absorbed, and creates hole—electron pairs. These holes and 
electrons are equal in number to maintain charge neutrality, and since they are not in 
equilibrium, in due course they recombine; this recombination may be radiative or non- 
radiative. Radiative recombination, when a photon is emitted usually at the bandgap 
energy, only occurs in direct bandgap material, whereas non-radiative recombination 
may occur in both direct and indirect bandgap semiconductors. In indirect semicon- 
ductors, this non-radiative recombination requires a phonon to mediate the process. 
Non-radiative processes in direct bandgap material are usually through traps or due 
to surface recombination. The direct or indirect band gap defines whether the lowest 
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position of the conduction band aligns with the maximum of the valence band along 
momentum space, where the effective momentum value k is equal to zero. 

Direct bandgap semiconductors are capable of photon emission, by radiative recom- 
bination, but indirect semiconductors have a low probability of radiative recombination. 
However, indirect bandgap semiconductors may have isoelectronic impurity states 
which are direct, and therefore the recombination from these states may also be radia- 
tive. GaP, which is an indirect gap semiconductor, may be doped with zinc oxide or 
nitrogen to produce these states, and the widely used green or red light-emitting diodes 
are examples of this emission. 


Absorption processes 


Considering Figure 1.17, we note that three different valence bands are shown. Equa- 
tion (1.39) had linked the carrier mobility the second derivative of energy with respect 
to k. It is then easy to understand why the band with less curvature at k = 0 (Vj) is 
called the “heavy hole band”, while the one which is more strongly bent (V2) is called 
“light hole band”. In most bulk semiconductors, the light and heavy hole bands con- 
cide at k = 0 — they are degenerate. The band V3 is called the “split-off band”. Note 


Vo V3: 
L T x 

T 20 T 2a 
ql111] k[000] = [100] 31111] k[000] = [1001 


Band diagram of bulk Ge (left) and Si (right). Note that the minimum of the conduction band is 
not aligned with maximum of the valence band at k = 0, indicating that these are indirect 
semiconductors. S. Wang, Fundamentals of Semiconductor Theory and Device Physics, 1st 
Edition, pp. 233-234, ©1989. Reprinted by permission of Pearson Education, Inc., Upper 
Saddle River, NJ. 
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Band diagram of bulk GaAs. Note that the minimum of the conduction band is aligned with 
maximum of the valence band at k = 0, indicating that this is a direct gap semiconductor. 

S. Wang, Fundamentals of Semiconductor Theory and Device Physics, \st Edition, pp. 233-234, 
©1989. Reprinted by permission of Pearson Education Inc., Upper Saddle River, NJ. 


that in each of these cases, the minimum of the conduction band is not aligned with 
the maximum of the valence band where k = 0, which implies that these are indirect 
semiconductors. In Figure 1.18, the band diagram of the direct semiconductor GaAs is 
shown; here the conduction band minimum is along the k = 0 axis and aligned with the 
maximum of the valence band. Note the degenerate valence band of heavy holes and 
light holes and the split-off band. 

Photo-excitation of semiconductors with photons energies equal to or greater than 
the bandgap energy of the material results in absorption, which in turn causes the 
creation of hole—-electron pairs for each photon. The major source of absorption in 
semiconductors is the valence band to conduction band transition. In the case of direct 
semiconductors, the transition occurs when the photon energy is at the bandgap value 
or larger and results in the transition of an electron in the valence band to the conduc- 
tion band. In indirect semiconductors, the absorption has to be mediated by phonons. In 
addition to the band to band absorption, transitions take place from acceptor to donor 
levels, from acceptor to conduction band, valence to donor level, all of which result 
in absorption below the bandgap energy. Figure 1.19 shows schematically the band 
to band transition for the direct gap semiconductor, and in Figure 1.20, the phonon- 
mediated transition. The conduction band to valence band and impurity band to impurity 
band transitions are shown schematically in Figure 1.21. Not shown in this figure are 
the impurity band to conduction and valence band transitions, all of which lead to 
absorption and emission of the appropriate photon energies. 
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Fig. 1.19 Schematic diagram of the conduction and valence bands of a direct semiconductor and the 
transitions. 


Fig. 1.20 Schematic diagram of the indirect semiconductor and the phonon-mediated transitions. 


E, 
Donor 
band 


Acceptor 
band 
Ey 


Fig. 1.21 Transitions possible with a semiconductor with impurity donor and acceptor bands: conduction 
band to valence band and impurity band to impurity band are illustrated. Others, conduction 
band to impurity band and valence band to impurity band have not been shown. 
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The absorption rate of the band to band transitions, for both the direct and indirect 
transitions may be calculated using quantum theory, but is not included here. 

Free carrier absorption also occurs in most semiconductors as the carrier density is 
always non-zero. The absorption of a photon by a carrier within a band results in the 
carrier having a larger energy. The absorption coefficient is proportional to the carrier 
density [3]. This effect is important in the design of waveguide devices, where typically 
this may result in absorption of the order of 1dBcm7! when the carrier densities are 
high in the 10!8 cm-? region. 


Exciton absorption 


In pure semiconductors, the absorbed photon with bandgap energy or larger may create 
excitons, which are electron-hole pairs that are bound, and in the binding process give 
up the binding energy. The binding energy of these excitons is of the order of about 
4.5 meV, and at low temperatures, an excitonic absorption peak is seen a little below 
the band to band absorption energy. At room temperature, this peak is not seen in bulk 
material, because the thermal broadening due to optical phonons is comparable, and 
the excitons that are created dissociate very rapidly. In quantum wells, however, the 
excitons remain extant at room temperature due to enhanced binding energies, which are 
typically two or three times that of the thermal broadening energy. Thus, the absorption 
characteristics of the material with quantum wells also show the excitonic absorption 
in addition to the usual band to band absorption. When a transverse electric field is 
applied to the quantum well, the absorption edge shifts to a longer wavelength. A simple 
explanation of this phenomenon is shown in Figure 1.22, where the schematic wave 
functions of the electron and hole in the quantum well are shown. When the transverse 
field is applied, then the quantum well bands tilt, and the resulting gap between the 
electron-hole wave functions decreases, which results in the absorption edge moving to 
a smaller energy and thus a longer wavelength. 


A schematic diagram of the wave functions in a quantum well, and the effect of applying a field 
across the well, resulting in tilting of the wells. This so-called Quantum Confined Stark Effect 
reduces the effective band gap of the material. 
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Absorption coefficient for various semiconductors (M. Shur, Physics of Semiconductor Devices, 
Prentice Hall, 1990 ©Prentice Hall). 


Other absorption mechanisms are due to valence to impurity band, impurity band 
to other impurity band or impurity band to conduction band transition, intraband 
absorption between different levels in the same band, and free carrier absorption. 

The absorption spectra of different semiconductors is summarised in Figure 1.23. 


Recombination and radiation in semiconductors 


The absorption of photons by the semiconductor results in the generation of elec- 
trons and holes, which disturbs the equilibrium status of the semiconductor. Electrical 
injection also results in this non-equilibrium of an excess of electrons in the conduc- 
tion band and an equal number of holes in the valence band. These recombine, both 
non-radiatively and radiatively, the latter in direct gap semiconductors. In general, the 
radiative transitions are dominated by the conduction band to valence band emission 
and therefore define the energy of the emitted photons. Other recombination processes 
include exciton recombination, donor to acceptor and other impurity recombinations. 
The radiation spectrum from recombination is generally shifted to lower energy from 
the absorption spectrum, and this is termed the Stokes or the Franck—Condon shift due 
to imperfections in materials or interfaces. 
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In general the excess electrons and holes decay at some rate, resulting in the density 
varying as exp(—t/t), where t is defined as the lifetime of the carriers. The decay of 
these carriers results in transfer of energy to the lattice in the form of phonons for the 
non-radiative decay and transfer of energy to photons for radiative decay. 

The corresponding lifetimes are labelled as tp; and t, for the non-radiative and radia- 
tive decay, respectively, and the corresponding non-radiative and radiative rates are Rpr 


and R,, respectively. Thus, the total lifetime constant Tt is given as: 
1 1 1 
—-=—++4+-. (1.54) 


T tar Tr 


The corresponding total spontaneous rate of recombination is given by 
Rspon = Ry + Ry. (1.55) 


Devices such as the light-emitting diode (LED) largely depend on spontaneous emis- 
sion, and in this case the internal quantum efficiency is given by 
R, 

Ror + R, ; 

The exponential decay rate of the excess carriers is inversely proportional to recombi- 
nation rate, and if the excess of electron is An, then the recombination rates R; and Ror 
are given by the expressions: R; = An/t, and Ryr = An/Tpr. Then internal quantum 
efficiency may also be written as 


(1.56) 


Ninternal = 


= | Thr 7 
7 . 1 5 
? Jinternal 1 + ai Tr + Tor 


eae ee 
Tr Tor Tor 


The total spontaneous recombination rate is given by the equation: 


Rota = AAn + BAn? + CAn’?. (1.58) 


The first term is the Shockley—Read—Hall recombination due to defects and traps, the 
second is the spontaneous emission due to radiative transition, and the third is the Auger 
recombination term. Auger recombination is non-radiative, occurs at high injection 
levels, and is a three-particle process. It becomes important in ternary and quater- 
nary compounds of InP-based materials, and is evident in the long wavelength laser 
structures. 


Spontaneous and stimulated emission 


The radiative recombination process discussed above occurs spontaneously, and this 
is used in traditional LED structures. In lasers, stimulated emission is the source of 
light, and in this section the relationship between absorption, spontaneous emission and 
stimulated emission, first outlined by Einstein in 1917, is discussed. The derivations 
given here follow the approach outlined by Casey and Panish [4] and Agrawal [1]. 

It can be shown that the blackbody radiation law is given by 


8nn > E2 


= 13c3(eE/*T — 1) 


, (1.59) 
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Transitions from level E, to E> for absorption and from E> to E; for emission, Fy and F> are 
the electron and hole quasi-Fermi levels respectively. 


where 77 1s the index of the material under consideration, / is Planck’s constant, c is the 
speed of light in vacuum, F is the energy given by hv, v is the frequency and k is 
Boltzmann’s constant. This is the expression for the energy density blackbody radiation 
P(E), and is in thermal equilibrium, when the input radiation is equal to the outgoing 
radiation. 

For a semiconductor, consider the transitions from the conduction band to the valence 
band and also the reverse. The energy levels in each of these bands have to obey the 
Pauli exclusion principle, which implies only two carriers at each level. Thus, the band 
is a series of levels, as shown in Figure 1.24, and the transition energy for an electron 
from a level E in the valence band to a level E2 in the conduction band requires that an 
incident photon has energy given by hv = E2 — E. Let the probability of this transition 
taking place be given by Bj2, and let f; be the probability that an electron exists at level 
E, and (1— f2) be the probability that a vacancy occurs at level E. Also assume that the 
radiation density of photon energy incident on the semiconductor is given by P(E2}). 
Then the upward transition rate is given by 


ry2 = By fid — fo) P(E21). (1.60) 


Note that f| and f2 take the form of the Fermi—Dirac distribution 


1 


f= CERT 1" et 


where Fj is the corresponding quasi-Fermi level, k is Boltzmann’s constant and T the 
temperature in Kelvin. 

Similarly, the downward transition rate, now called the stimulated transition, is 
given as 


roi (stim) = Bo; fod. — fi) P(E£21), (1.62) 


where Bp, is the transition probability, f> is the probability that an electron is present 
at Ey and (1 — f1) is the probability that there is vacancy at EF}. 

Finally, there is the spontaneous transition from Ez to E;, without any incident 
radiation involved, given by 


r21(spon) = Az) fo(1 — fi). (1.63) 
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In thermal equilibrium, the input radiation is equal to the output, the Fermi levels F, = 
F, and hence 


rj2 = 21 (spon) + r21 (stim). (1.64) 


Equating, simplifying and noting that P( £21) is the blackbody radiation term, 


Sr? E2 
P(E2) = aca gEalF (1.65) 
Ar fo — fi) 
= 1.66 
Bofid — fo) — Ba fod — fi) ee) 
A2t (1.67) 


~ BygeBalkT — Boy” 


Equating Equations (1.65) and (1.67) and separating them into temperature-dependent 
and temperature-independent terms give the following results: 


A 82> E2 (1.68) 
a= 33 Bai 
and 
Bo, = Bj. (1.69) 


These are Einstein’s coefficients and their relationships with each other. 

The condition under which stimulated emission dominates is an interesting one. This 
requires a non-equilibrium condition in which the presence of incident radiation is 
required. This results in the population densities in the conduction and valence bands to 
be different from the equilibrium condition. For stimulated emission to dominate, the 
stimulated emission rate, 21 (stim), needs to exceed the absorption rate 712. Substituting 
from Equations (1.60) and (1.62), 


Bo fo — fi) P(E21) > Bi fid — f2)P (E21). (1.70) 


Since By; = Bj2, this equation becomes 


fod — fi) > fil — fa). (1.71) 
Substituting for f; and f2 from Equation 1.61, this equation becomes 
eFa—-FiWkT 5 Q(E2—E1)/kT (1.72) 
or 
fo -—F, > E2-£). (1.73) 


This implies that the difference in the quasi-Fermi levels is greater than the emission 
energy of the photon. If the emission is at bandgap energy, then the difference between 
quasi-Fermi levels needs to be greater than the bandgap energy Eg. 
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Carrier transport in semiconductors 


Drift and diffusion are the two mechanisms whereby carriers are transported in semi- 
conductors such that there is current flow. It will be assumed that thermal equilibrium 
will not be disturbed during these processes [2, 8, 10, 15]. 


Drift current 


When an external electric field is applied to a semiconductor, it produces a force that will 
accelerate the electrons and holes in opposite directions as long as there are available 
energy states in the conduction and valence bands. The net drift of charge will produce 
a current which is the drift current. If the electric field is denoted as €, the drift current 
densities for electrons and holes are written as 


JIn(arift) = @NVadr = QMnnE (1.74) 


J pcarift) = FP pdr = qipn€, (1.75) 


where q is the charge on a particle (electron or hole), J is the surface density of current, 
Undr and Updr are the drift velocities of electrons and holes, respectively, and ju is the 
mobility. 


Diffusion current 


Electrons flow from a region of higher concentration to a region of lower concentration, 
producing a flux of electrons and an electron diffusion current which is in the opposite 
direction to the flux. The hole flow is such that the hole flux and the hole diffusion 
current are in the same direction since the holes are positively charged. The diffusion 
current densities for electrons and holes are given by 


dn 
Jnaitf = ¢Dn— (1.76) 
dx 
d 
j8==90, (1.77) 
P q Pdx 


where D, and Dp» are the electron and hole diffusion coefficients respectively. The 
diffusion coefficient is related to the mobility jz by the Einstein relation: 


kT 
p= =. (1.78) 
q 
Hence, 
D D kT 
Sa i, (1.79) 
Ln Mp qd 
Adding Equations (1.74)-(1.77), 
dn d 
J = (quinn + qupp)E + qDa— — gDp—. (1.80) 
dx dx 
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1.17 p-n junction 


When a junction is formed by a p-type and an n-type semiconductor, holes move from 


the p to the n side across the metallurgical junction and electrons move in the opposite 


direction. There are concentration gradients of electrons and holes giving rise to diffu- 


sion. Furthermore, when the electrons leave the n region, positively ionised donor atoms 


remain behind and, similarly, negatively ionised acceptor atoms remain in the p region. 


These ionised donors and acceptors reside on both sides of the metallurgical junction 


and are not mobile. The length of this region increases as the diffusion continues. The 


resultant electric field is directed from the positive charge to the negative charge. This 


field builds up in such a way as to oppose the diffusion of the carriers in both directions 


across the junction. An equilibrium condition is reached and there is no net flow of cur- 


rent across the junction. The ionised region on both sides of the metallurgical junction 


is called the depletion or space charge region. The p—n junction is seen in Figure 1.25. 


The band diagram and space charge distribution of a p—n homojunction are shown in 


Figure 1.26. The general form of Poisson’s equation for an abrupt junction where there 


is an abrupt change in the doping concentration is 


dé 
= 4p —n+ Np —Na), 
dx € 


where p is the hole concentration, n is the electron concentration and Np and Nag are 


the ionised donor and acceptor concentrations respectively. If the metallurgical junction 


is the origin, the depletion region on the p side extends to —xp and on the n side to Xp. 


Poisson’s equation is written for the depletion regions on either side of the metallurgical 


junction as 


dE qd 

a «NM Xp <x <0 
dé 

Se O<x < Xp, 
dx Es 


where €, is the permittivity of the semiconductor material. The regions outside the 


depletion region are neutral regions and hence the electric field is zero. Equations (1.82) 


and (1.83) are solved using the boundary condition on the electric field to get 


p n 
+«— Depletion region —+ 


+e H 
tREE 
+ 
REE 


= Xp 


— >, X 


Fig. 1.25 p—n junction at equilibrium. 
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p-type n-type 


~<t— Diffusive electron transport 
Electron field transport ——> 


<t—\ Hole field transport 
Diffusive hole transport —-»> 


p(x) 


Fig. 1.26 p—n junction without externally applied voltage — band diagram (centre) and space charge 
distribution (bottom). 


N. 
E=-TAGixn), xy <x <0 (1.84) 
Es 
N 
fas 25h O<x < Xp. (1.85) 
és 


The potential is related to the electric field by the equation 


E= aa (1.86) 
dx’ , 
Integrating Equations (1.84) and (1.85) 
N, 
Vix) = LA@4+ ap), xp <x <0 (1.87) 
2€s 
Ni 
VQx)= _ Pix—xa)’, O<x<m. (1.88) 
Es 
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The built-in potential 


The built-in potential on the p side of the junction is the potential difference across the 
depletion region. It is determined similarly on the n side. 


qNa 


Voip = “5 (1.89) 
Ss 
Np 
Vig S x2, (1.90) 
Ss 
The total built-in potential across the junction is 
Voi = (Voip + Voin) (1.91) 
q 
= 5<,lNaxp + Npx?]. (1.92) 
Ss 


The continuity of the electric field across the junction at x = O requires that 
Naxp = NpXn. (1.93) 


It is assumed that the dopants are fully ionised and the total ionised positive charge 
per unit area on the n side is equal to the total ionised negative charge per unit area on 
the p side. At thermal equilibrium, there is no net current flow and hence the drift and 
diffusion currents are equal. The electron current is 


Jn = 0 (1.94) 
= Jnaritt + Jnaift (1.95) 

dn 
= qunnE + qDn—. (1.96) 

dx 

The hole current is written as 

Jp =0 (1.97) 
= Jparitt + Jpaitt (1.98) 


dp 
= E—-—qDp—. 1.99 
Ghgpe =a (1.99) 
When the net hole current is zero, and with the electric field equal to the gradient of the 
potential, it may be shown that 


(1.100) 


1 
It has been assumed that there is full ionisation of the dopant impurity levels such that 
the majority carrier concentrations are the doping concentrations and the equilibrium 
concentrations are related by 


no po = ne. (1.101) 


The depletion layer width 


The widths of the depletion layer in the p- and n-type semiconductors may be calculated. 
The maximum electric field occurs at the metallurgical junction, x = 0. This is given by 


€sEmax = @NpXn = QNaXp. (1.102) 
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Using the Equations (1.92) and (1.93) with 


Emax 


| Voi | = an + Xp (1.103) 
W=X%1+Xp (1.104) 
- (c+a) | Voi |- (1.105) 

q \Nd Na 


The depletion capacitance 


The depletion capacitance is the capacitance at the p—n junction. The depletion layer is 
modelled as a parallel plate capacitor. The capacitance is written as 
€sA 
GSS 1.106 
i= Ww ( ) 
where A is the area of the p—n junction and W is the depletion layer width given by 
Equation (1.105). The junction capacitance is given by 


NAN 
G24) (1.107) 
2Voi(Va + Np) 


At thermal equilibrium, the total electrostatic potential across the p—n junction is the 
built-in potential, Vp;, and the potential difference between the p and n regions is q Vpj. 
If now a voltage Va is applied with the positive terminal connected to the p side and the 
negative to the n side, the junction is forward-biased and the total electrostatic potential 
across the junction is V4; — Va, resulting in a reduction of the depletion layer width. 
A potential barrier was formed at thermal equilibrium restricting the motion of the 
majority carriers. The application of the forward bias reduces the height of the bar- 
rier. If, on the other hand, a voltage is applied with the positive terminal connected to 
the n side and the negative terminal to the p side, the electrostatic potential across the 
junction is Vp; — (— Va) and the height of the barrier is increased with the reverse bias. 
The depletion widths and the energy band diagrams are shown in the figure. The width 
of the depletion layer is given by 


ie [= (2) (Ve eV): (1.108) 


p—n junction under bias 


q NaNp 


Current-voltage characteristics 


The total current density in a p—n junction is given as: 


Dynpo Dp Pro qVa 
= —]-l], 1.109 
a| Tt Ey | |S? (“er (1.109) 
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where 
Ly = VDath (1.110) 
Lp = V Dp (1.111) 
n- 
Ny = — (1.112) 
P Na 
nm (1.113) 
j= =; : 
Po Np 
The current density expression now reduces to 
qVA 
J= J =} =I] 1.114 
ofexw (SE) -1] (1.114) 
where 


J 2) {Pa 1 4 aes (1.115) 
=gn: ‘ F 
cae T Na Tp Nod 


where Dy and Dy are the Einstein coefficients for electrons and holes and Tt, and Tp are 
the electron and hole lifetimes. 


Schottky diode 


Another important type of semiconductor junction is the Schottky diode, a metal— 
semiconductor junction. 

Its band diagram is shown in Figure 1.27, assuming that the semiconductor layer 
is n-doped, that the doping concentration is constant throughout the layer (homoge- 
neous doping), and that the Boltzmann approximation for the Fermi distribution is 


Metal n-semiconductor 


y=Yo 


Band diagram of a Schottky contact on a homogeneously n-doped semiconductor layer, without 
an externally applied voltage. The bottom graph shows schematically the extension of the space 
charge region. 
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valid. We see that under ideal circumstances, the Schottky barrier height gp is the 
difference between the metal work function gm and the electron affinity xp. In prac- 
tice, however, the effective Schottky barrier height @p is also influenced by states at the 
metal—semiconductor interface and shows only a small dependence on the metal work 
function. On GaAs, gp © 0.8 eV. 

The built-in voltage can be easily calculated: 
Nc 
Np’ 
where Nc is the density of states in the conduction band and Np the semiconductor 
doping concentration. 

The Schottky barrier gp causes a depletion of the semiconductor layer immedi- 
ately adjacent to the metal—semiconductor interface. Devoid of electrons, the positively 


Voi = Ob — kT In (1.116) 


charged ionised donors remain and form a space charge region with a space charge den- 
sity e = qNp. It is indicated in the bottom graph of Figure 1.27 — we make the usual 
“box shape’ assumption for the space charge region, i.e. we assume the space charge 
concentration to be constant throughout until y = yo, then ending abruptly. 

For a homogeneously doped semiconductor with a permittivity of €, and a doping 


concentration of Np, 
2€s Voi 
o= (1.117) 
: \ gNp 


without any externally applied voltage. 


Heterostructures 


The ability to mix semiconductors of different chemical composition in a single 
crystal gives an important degree of freedom in device design. The combination 
of semiconductor materials of different stoichiometry in a single crystal is called a 
heterostructure. 

Of foremost interest is the ability to change the energetic width of the forbidden gap — 
the bandgap energy, or band gap in short. This is shown in Figure 1.28 for some popular 
atomic and binary semiconductor materials. We note that changing the stoichiometry 
not only modifies the bandgap energy, but also the lattice constant, which can be under- 
stood as an average distance between the atoms in the crystal. This change in lattice 
constant is a major complication when designing devices, but we will exclude it for 
now by considering the material system (Al,Ga)As, where the lattice constant is almost 
independent of stoichiometry. 

The ability to change bandgap through stoichiometry opens up several interesting 
design options. 


e We may, for example, introduce a built-in electric field for one carrier species, but not 
for the other — schematically shown in Figure 1.29. This is a p-type semiconductor 
which has a smaller bandgap on the left-hand side (£1) than on the right-hand side. 
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Bandgap energy and lattice constant for several popular semiconductor materials. 


Hypothetical band diagram of a p-doped graded heterostructure. 


This could be done by starting with GaAs and gradually increasing the aluminium 
content while progressing to the right. Due to the p-type doping, the valence band 
stays approximately equidistant to the Fermi energy Ep (neglecting the change in 
the valence band density of states Ny), which is constant in thermodynamic equilib- 
rium.! Due to the change in bandgap energy, the conduction band energy will change 
strongly and provide a built-in drift field for electrons, which in this schematic will be 
accelerated from right to left. 

We will later use such a structure to accelerate the electrons in the base of a 
heterostructure bipolar transistor. 
Or we may abruptly change the bandgap by an abrupt modification of the stoi- 
chiometry (see Figure 1.30). In this case, we use an n-type semiconductor material, 
so the conduction bands remain approximately lined up horizontal (save for the 
stoichiometry-induced change in the conduction band density of states), but the 
change in bandgap results in a significant additional energy barrier for holes, which 
will keep them from moving from region | into region 2. This carrier confinement 
is at the heart of any semiconductor laser structure, and will also be used in het- 
erostructure bipolar transistors. Note that a comparable energy barrier does not exist 
for electrons — an energy barrier has been created for one carrier species only, a feat 
possible only through the introduction of semiconductor heterostructures. 


The picture is a simplification because changing the stoichiometry also changes the density of states in the 
valence band, so there will be some variation in the Ep to Ey distance, which has been omitted. 
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Energy band diagram of an abrupt transition between two materials in a semiconductor 
heterostructure. 
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Constructing heterostructure band diagrams using Anderson’s rule. 


Constructing heterostructure band diagrams 


To efficiently use heterostructure band diagrams in the understanding of high-speed 
electronic and optoelectronic devices, we have to be able to construct their band 
diagrams. A simple procedure shall be described here. 

It uses a modification of Anderson’s rule, which has been a proven technique to draw 
the band diagram of the important SiO2—Si interface. Anderson’s rule postulates that 
the vacuum energy level is continuous and uses the electron affinities, i.e. the ener- 
getic spacing between the vacuum level and the conduction band, to calculate the band 
alignment in an abrupt heterostructure. The electron affinity x is material-dependent. 

When constructing the band diagrams, see Figure 1.31; we start out with the Fermi 
energy Ep, which in thermodynamic equilibrium is constant throughout the structure. 
We next draw conduction and valence bands far away from the interface between the 
two materials, which requires knowledge of the doping type and concentration, the 
bandgap energy and the densities of state as described in the introductory review section. 
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Next, we use knowledge of the electron affinities x in the two materials to draw the 
vacuum level, again far away from the interface. Poisson’s equation can be used to cal- 
culate the exact potential in the interfacial region, which is used to draw the continuous 
vacuum energy level. The material properties remain constant right up to the interface, 
so we can draw the conduction and valence bands to be perfectly parallel to the vacuum 
level. 

The resulting conduction and valence bands show discontinuities at the interface 
between the two materials, in direct consequence of the the different electron affinities 
and the continuity of the vacuum energy level. 

The built-in potential Vj; can be calculated as follows (refer again to Figure 1.31). 
We assume here for simplicity’s sake that the Boltzmann equation can be used in lieu 
of the Fermi—Dirac function: 


(1.118) 


Nv iNew 
qa: Voi = na — a+ Bo + RT in (SEE, 


NatNp 


Assume that you externally apply a voltage —Vpj to the structure, which compensates 
the built-in potential — the energy bands would then be constant within the two regions 
of the heterostructure (flatband condition); now you easily recognise that 


AEc = x1- x (1.119) 


and 


AEy = Eg — Eo, — AEc = AE — AEc. (1.120) 


However, in reality, the experimentally determined band discontinuities of heterostruc- 
tures do not agree well with the values predicted using the electron affinities. This has 
to do with the different interface conditions between a free surface (used to determine 
the electron affinities) and a semiconductor heterostructure. 

Anderson’s rule can still be used, however. Note that only the difference of the elec- 
tron affinities matters, not their absolute values. Hence, you can place the vacuum 
level at an arbitrary distance from the conduction band when drawing the band dia- 
gram, provided that the difference in the hypothetical electron affinities agrees with the 
experimentally determined AEc. 


Band line-up 


Abrupt heterostructures can have three fundamental band alignments — refer to Figure 
1.32 for a schematic representation: 


(i) In a type J heterojunction, the conduction band of the material with the lower 
bandgap is below the conduction band of the material with the higher bandgap, 
but the valence band of the lower-bandgap material is above the valence band of 
the higher-bandgap material. The smaller bandgap is hence fully within the larger 
bandgap. 
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Schematic representation of heterostructure line-ups. 


(ii) In a type 2 heterojunction, the conduction band and the valence band of one mate- 
rial are below their counterparts in the other material. This is sometimes also 
referred to as a staggered line-up. 

(iii) Finally, in a type 3 heterojunction, the valence band in one material is above the 
conduction band in the other. This is called a zero-gap configuration. 


In current practical devices, the type | line-up is by far the most common. 


Lattice mismatch 


As already observed in Figure 1.28, changing the stoichiometry will generally also mod- 
ify the lattice constant. When combining materials with different lattice constants, the 
mismatch will create strong mechanical strain at the interface (experienced as tensile 
strain by the material with the smaller lattice constant and as compressive strain by 
other materials). 

Please refer to Figure 1.33. We shall visualise in a schematic fashion the problem of 
lattice mismatch, taking the important Si/SiGe heterostructure as an example. 

In this example, a thin SiGe layer shall be sandwiched between two thick Si layers.” 
Provided that the thickness of the SiGe layer remains below a certain critical thickness, 
all of the lattice mismatch can be compensated for by elastically straining the thin layer 
(and a small fraction of the neighbouring layers). This case is also called the pseudo- 
morphic case, a term which we will encounter later on, e.g. in the discussion of modern 
field effect transistor (FET) structures. 

If, however, the strained layer thickness is significantly increased beyond the critical 
thickness, the mechanical forces at the interface become so large that they are able to 
break crystalline bonds — a crystal defect is created which will have detrimental effects 
in both optoelectronic and electronic structures and is hence to be avoided (but for a 
very few special cases, where this plastic strain relaxation is used deliberately in areas 
of the device devoid of mobile charge carriers). 

Hence, the critical thickness is a parameter which must be carefully obeyed in het- 
erostructure device design. It depends on both the lattice mismatch and the elements 
forming the heterostructure. 


? ‘Thin’ here means that the whole layer can be deformed by the mismatch-generated strain, while ‘thick’ 
means that the largest part of the layer remains unstrained. 
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Schematic representation of lattice mismatch in heterostructures. 


This introductory section on heterostructures concludes with what 2000 Nobel lau- 
reate Herbert Kroemer has called the Central Design Principle of semiconductor 
heterostructures: 


Heterostructures use energy gap variations in addition to electric fields as forces acting on holes 
and electrons to control their distribution and flow [7]. 


Our future treatment of high-speed electronic and optoelectronic devices will only 
exemplify this fundamental observation. 


Silicon—germanium heterostructures 


Until the late 1980s, practical electronic and optoelectronic devices, if they included 
compound semiconductor materials, were dominated by materials combining elements 
from the third and fourth columns of the periodic table of elements, such as GaAs or InP. 
However, compound semiconductors may also be formed from elements in the fourth 
column — elements which already are semiconductors. One example is SiC, which has 
been investigated extensively as a blue-light emitter. As an aside, SiC under its old trade- 
mark of ‘carborundum’ was the first semiconductor material for which light emission 
was ever observed and is finding renewed interest for high-power transistors [1 1]. 

The currently most important group IV-IV semiconductor material, however, is the 
silicon—germanium alloy which shall receive special treatment here commensurate with 
its importance. 
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Table 1.5 Basic properties of silicon and germanium 


Property Silicon Germanium Unit 


Lattice type Diamond Diamond 
Lattice constant 0.5431 0.5657 nm 
Direct band gap 3.40 0.80 eV 
Indirect band gap 1.11 0.664 eV 
Indirect bandgap direction {100} {111} 
Relative dielectric constant 11.9 16.2 
1 
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Critical thickness of a strained SiGe layer on Si. 


Table 1.5 [13] summarises some fundamental properties of silicon and germanium. 
As we see, both are indirect semiconductors. Of special interest is the difference in band 
gap, which will allow to build efficient heterostructure devices. However, the significant 
difference of the relative dielectric constant (and consequently the refractive index) has 
also been used in photonic waveguiding devices. 

We also note that the lattice constant is quite different — the ratio of lattice constants is 


BOP 2 4042 


asi 


for a mismatch of 4.2%. 

Therefore, any SiGe heterostructure will be strained, and the strain has significant 
consequences. In most cases, a pseudomorphic heterostructure free from crystal defects 
will be desired, and hence the critical layer thickness has to be obeyed. 

There is, however, already some discussion about the value of the critical layer thick- 
ness. Refer to Figure 1.34 adapted from [5], which plots the critical thickness of a 
strained Sij_,Ge, layer on relaxed silicon. This is the case found in Si/SiGe heterostruc- 
ture bipolar transistors (HBTs), where the base is formed from a SiGe alloy, while the 
emitter and collector layers are silicon. 
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There are two substantially different curves for the critical thickness as a function of 
the germanium mole fraction x. 


(i) The equilibrium theory limit constitutes a safe upper limit in that strained lay- 
ers grown with combinations of Ge content and thickness will remain stable even 
under high-temperature processing. 

(ii) The People fit [9] takes experimental data into account. Here, depending on the 
growth temperature, much higher products of layer thickness and Ge mole fraction 
will still result in defect-free layers, provided that they do not encounter high- 
temperature processing after layer deposition. 


The region between the equilibrium-theory and People-fit curves is called the 
metastable region. Many practical SiGe devices possess Ge fraction and layer thickness 
combinations in this region. 

The strain also very substantially affects the band structure of SiGe heterojunctions. 
This is important to understand the concepts of SiGe heterostructure devices further 
down. In the above example of an elastically strained SiGe layer on a relaxed silicon 
layer, the heterojunction forms a type | interface, i.e. the SiGe band gap is located ener- 
getically in the forbidden gap of silicon. Most of the bandgap difference is reflected 
in the valence band discontinuity on an abrupt heterostructure. If, however, we place 
a thin strained Si layer on a relaxed SiGe layer, the heterojunction will be of type 2, 
the staggered line-up configuration, with the Si conduction band below the SiGe con- 
duction band. The resulting conduction band discontinuity will further down be used to 
construct n-channel Si/SiGe heterostructure field effect transistors (HFETs). 

The effect of layer composition has been compiled by Schiaffler [12] into a convenient 
chart, shown in Figure 1.35. 

Its use shall be demonstrated using the following example. Suppose you are looking 
for the band alignment of a strained Sip; Geo.9 layer on relaxed Sig.4 Geog. In this case, 
Xrelaxed = 0.9 and Xstrained = 0.6. In this case, we read AEc =O and AEy = +220 meV. 
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Conduction band (A EF) and valence band (A Ev) discontinuities of Si; __,Ge x heterostructures, 
depending on the relaxed-layer and strained-layer Ge mole fraction x. The discontinuity values 
are listed in meV. Graphs adopted from Schiffler (1995), Ref. 12. 
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Electron mobility enhancement in strained-Si MOSFET channels [6]. Used with permission 
from Oerlikon Systems. 


Another interesting effect of strain in Si/SiGe heterostructures is that it modifies 
mobility, an effect investigated for performance enhancement in Si MOSFETs. Here, 
a thin silicon layer is deposited on top of a relaxed SiGe buffer. The SiGe buffer does 
not carry any current, but provides a means of introducing a tensile lateral strain into the 
silicon channel layer above. Depending on the electric field in the channel, a mobility 
enhancement of 30% or more may result, compared to the Si universal mobility curve 
(see Figure 1.36). [6] 


Problems 


(1) Consider a three-dimensional potential well which is infinite such that the 
following conditions are satisfied: 


VQ, y,z) =0,-a<x<a,-a<y<da,-a<z<a, (1.121) 


Outside the well 
Vix, y,zZ) =O. (1.122) 


Show that the energy is quantised and determine its form. 

(2) (a) Determine the amplitudes of the transmitted and reflected waves in the 
finite potential barrier. Use these results to determine the transmission and 
reflection coefficients and hence the tunnelling probability (also called the 
transmissivity) given in Equation (1.25). 

(b) A potential barrier has width a = 30A and height Vo = 0.5 eV. An electron 
is incident on this barrier. Sketch the transmissivity function. 


More free ebooks : http://fast-file.blogspot.com 


44 


High-Speed Electronics and Optoelectronics 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(1) 


(c) The barrier width is reduced to 5A. Sketch the transmissivity function. 
Comment on the difference as the barrier width is reduced. 


For the Kronig—Penney model, obtain the solution of the transcendental equation 
and show the allowed energy bands (MATLAB is useful in this problem). 


A sample of GaAs is doped with a background concentration of Na = 3 x 
105 cm-3 acceptors. Then, 10!9 cm~3 donors are added. Find the room tem- 
perature concentrations of electrons and holes in the original material and the 
material that results after the addition of the donors. Draw the band diagram for 
each of the materials. 


A silicon ingot is doped with 10!’ cm~? arsenic atoms. Find the carrier concen- 
trations and the Fermi level at 300° K. Assume complete ionisation of impurity 
atoms. Draw the band diagram. Show the Fermi level and use the intrinsic Fermi 
level as the energy reference. Repeat for a temperature of 77° K. 


(a) The energy band gap for GaAs at 300K is 1.42eV. Assume that the Fermi 
energy level is at mid-band. Calculate the probability that an energy state at 
E = E,+kT/2 is occupied by an electron. 

(b) What is the probability that the energy state at E = Ey — kT /2 is empty? 

(c) Using the density of states function g(£), determine the volume density of 
states between 0 and 2 eV. (Integrate g(e)dE.) 


A sample of InP is doped with 5 x 10!° tellurium atoms/cm? and 2 x 10!5 cm? 
zinc atoms. Calculate the electron and hole concentrations. 


A sample of GaAs is illuminated with light of intensity 1.5 mW cm7 at a wave- 
length of A = 470 nm. The area of the illuminated surface is 15 mm/?. If the carrier 
lifetimes are 10 ns, how many photons are incident on the surface of the GaAs 
sample? If there is uniform absorption of the photons within | j1m of the sample 
surface, determine the equilibrium concentrations of electrons and holes in the 
dark and the excess carrier concentrations when the sample is illuminated. 


(a) An n*-p diode has a long p region. The current J is measured at a con- 
stant forward bias of 500 mV. Derive an expression for the fractional change 
in current AJ/J resulting from a temperature change of AT. Evaluate this 
expression at T = 300K. Assume that the diode current is dominated by 
recombination of electrons in the neutral p region and consider the temper- 
ature dependence to be due solely to the factors that have an exponential 
dependence on temperature. 

Suppose that part of the diode current results from recombination in the 
depletion region. How will this affect the sensitivity of the temperature 
measurement? 


(b 


we 


A Silicon p-n junction has 5 x 10!” cm~? donor atoms and 4 x 10!9 cm7? acceptor 
atoms. The junction area is 150\1m?. Determine the junction capacitance given 
that the reverse bias is 5 V. How does this change for the same forward bias? 


It is found experimentally that a Schottky barrier is formed when Al is evaporated 

on n-type GaAs with a donor concentration of 4 x 10!>cm7?. Al has a work 

function of 4.36 V and GaAs has an electron affinity of 4.07 V. 

(a) Determine the built-in voltage Vpi, the depletion layer width W and the 
capacitance at zero bias, assuming no surface states. 
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(b) If the Fermi level at the surface is pinned so that 


1 
qpo = 3 Ps 


q¢p = (Eg —4q¢o), 


determine the same quantities as in part (a) with a reverse bias of 1 V. 

(12) An n*+-GaAs/p-AlGaAs heterojunction is formed with Np = 10!°cm~? and 
Na = 5 x 10!7cm7?. Draw the band diagram and determine the following 
quantities: 

(a) The electric field €(x) and the potential difference V(x) at the junction. 

(b) The built-in potentials on either side of the junction and the total built-in 
potential, Vpji. 

(c) The depletion widths in the two regions. 

(d) If a forward bias of 3 V is applied, calculate the depletion widths. 

(e) Calculate the capacitance of this diode. 

(f) Calculate the current in the diode. 
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Executive summary 


This chapter introduces the active devices commonly used in high-speed electronics. It 
starts with a discussion of the metal—-semiconductor field effect transistor, or MESFET — 
historically the oldest FET concept, which for decades was the most prominent device 
in microwave electronics. Its pitfalls led to the development of an advanced transistor 
structure, the high electron mobility transistor (HEMT). It incorporates heterostructures 
to gain additional freedom in device design. HEMTs mostly replaced MESFETs in 
micro- and millimetre-wave applications. 

Metal-oxide-semiconductor field effect transistors (MOSFETs), which dominate dig- 
ital electronics, are rapidly making inroads at microwave and even millimetre-wave 
frequencies. They will be discussed as well, and we will recognise similarities between 
HEMTs and MOSFETs in the physics of the intrinsic transistor. 

Finally, bipolar junction transistors (BJTs) will be introduced, showing how a 
dilemma in the optimum design of the base layer led to the invention of the hetero- 
junction bipolar transistor (HBT) — again, heterostructures come to the rescue. 

For all these components, the chapter will discuss their fundamental physical oper- 
ation, non-ideal and parasitic effects, and linear and non-linear models, as well as 
examples in several material systems. 


MESFET 


Introduction and current control mechanism 


The metal—semiconductor field effect transistor (MESFET) is conceptually the simplest 
of the commonly used transistor structures and shall therefore be discussed here first. 
The fundamental idea is quite straightforward: the current flowing through a slab of 
semiconductor material (from now on called the channel) depends on three fundamental 
parameters for a given externally applied voltage: 


(i) velocity of charge carriers, 


(ii) density of charge carriers, 
(iii) the geometric cross-section the carriers flow through. 
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15-5 


1] +-{1 [I \ omer 


Lilienfeld’s FET concept, from his US patent application in 1926. 
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\ 


Semi-insulating GaAs substrate 


(b) 


Simplified cross-section of a MESFET with (a) an epitaxially grown channel, (b) fabricated 
using ion implantation. 


While the carrier velocity will depend on the local electric field, in the simplest case 
the density of charge carriers is given by the doping concentration. The channel cross- 
section can be influenced externally, if we constrict the current flow using the depletion 
region of a diode. This method was recognised very early and is the object of a patent 
filed in 1926 by Julius Edgar Lilienfeld [35]. Lilienfeld’s concept (see Figure 2.1), 
already used a metal—semiconductor junction to control the current flow, but was never 
realised. The practical realisation of the MESFET is predated by the silicon junction 
field effect transistor (JFET), which uses a p—n diode as the controlling element and 
was first described by Shockley [57]. 

Figure 2.2 shows two somewhat simplified cross-sections of what a MESFET looks 
like. The layer structure in Figure 2.2(a) is defined by epitaxial growth. Above a semi- 
insulating substrate, a thin undoped buffer layer is grown to improve the interface 
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quality, then the channel layer follows whose doping concentration and thickness are 
very important design parameters, as we will shortly see. Above it, we find a highly 
doped contact layer, intended to improve the formation of non-blocking contacts and to 
reduce series resistances between the source and drain contacts and the channel region, 
but whose exact thickness and doping concentration have no bearing on the fundamen- 
tal properties of the transistor. Below the gate contact, the contact layer is etched away 
to allow the blocking Schottky contact to contact the channel layer directly. 

Figure 2.2(b) shows a very similar structure; only now the differently doped semicon- 
ductor regions are formed by ion implantation. This results in lower cost, however; the 
lattice damage caused by the ion bombardment will negatively impact carrier velocity 
and also lead to an increase in low-frequency noise. This will not be discussed in detail 
here. 

In both cases, it is assumed that the carrier species in the channel are electrons 
(n-channel), as this is the more common variant; however, p-channel devices can be 
fabricated with equal ease. 

While a MESFET can be structured on many different semiconductor materials, only 
devices fabricated in GaAs and in SiC are commercially relevant. The GaAs MESFET 
was, for many years, the mainstay of microwave solid-state electronics and shall be 
discussed here, while the SiC MESFET with its excellent thermal properties and high 
breakdown voltages is used predominantly in power amplifiers for mobile phone base 
stations. 

For the benefit of clarity and to obtain analytic expressions, we will simplify the 
structure even further. Figure 2.3 shows the three-dimensional view of the simplified 
structure. First of all, note the coordinate system which will be used similarly through- 
out. The x axis is parallel to the ‘long’ extension of the gate stripe. The y axis is 


Gate length 
Le 


Source =|» Drain 


One-dimensional channel 


Insulating substrate 


y 


Simplified MESFET structure used in the analytic calculations. 
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perpendicular to the semiconductor surface, while the z axis is parallel to the sur- 
face in the direction of the ‘short’ extension of the gate. The ‘long’ gate dimension 
in x-direction is called the gate width Wc, while the gate length Lg is the extension in 
the z direction. 

We assume now that the channel is one-dimensional — the electric field in the channel 
has only a z component. To neglect the electric field in the x direction is generally 
justified as WG > Lg, but to neglect the electric field in the channel in the y direction 
is a simplification. 

Another important simplification in the channel is the gradual channel approxima- 
tion. In general, current flow in semiconductor devices can be driven by the electric field 
(this is the drift current) or by concentration gradients — this is the diffusion current or a 
combination of both. Here, we assume that the drift current entirely dominates and the 
diffusion current can be neglected. 

The gate electrode forms a blocking contact with the semiconductor layer under the 
channel, a Schottky diode, which was discussed already in Chapter 1. 

Figure 2.4 shows only the cross-section of the device. The source electrode shall be 
the reference potential, hence Vs = 0. 

The gate electrode potential with respect to the source is the gate-source voltage Vcs. 
In an n-channel device, where the channel layer is n-doped, it will generally be negative 
to maintain the gate-channel diode in a blocking state. The drain-source voltage Vps in 
an n-channel device will be positive. ! 

The extension of the space charge region, shown schematically in Figure 2.4, depends 
on the local gate-channel voltage Vg. We find for Vg(z): 


Ve(z) = Ves — V(@), (2.1) 


Vos >0 


MESFET channel with space charge region for small drain-source voltages. 


' Because our structure is symmetric with respect to the non-blocking contacts, this defines the drain — in an 
n-channel device, the drain is the contact with the higher potential. 
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where V(z) is the voltage drop in the channel between point z and source. As the drain 
is at a higher potential than the source, V(z) > 0, the gate-channel voltage becomes 
more negative as z increases. 

At point z, the extension of the space charge region is 


: - ee Vo(2)] = | tte Vi) 
(z) = = 
qNp aNp 


(99) 


with Np the channel doping concentration, assumed to be constant throughout the 
channel. 


Drain current using a constant-mobility assumption 


Let us first consider small Vps, such that h(z) < a, with a the channel thickness for all 
0 < z < Lg — there is always an undepleted part of the channel remaining. We will 
now calculate the channel current. 

The channel current is always calculated in the same fashion: by multiplying the 
moved charge density (here, g Np), the cross-section through which it is moved (here 
Wola — h(z)]) and the charge velocity. For low fields, the charge velocity can be 
calculated from the electron mobility jz, and the local electric field, here dV (z)/dz. 
Therefore, we find an expression for the channel current as a function of the z 


coordinate: 
dV(z) 
T(z) = qNpWela — h(z) en — (2.3) 
As a consequence of Kirchhoff’s law, we know that the charge current entering at the 


source will be equal to that leaving at the drain — this is called current continuity. It all- 


Ows us to eliminate the z dependence of the current through a simple mathematical trick. 
As I(z) = const = Ip, obviously i I(z)dz = IpLc. 
Consider further that 


Dak —_ 2€s5 _ 
h’(zZ)= a [Voi — Vos + V(z)I. 


Differentiating both sides with respect to z leads to 
dh(z) es dV(z) 


2h(z = 
@) dz qgNp dz 
and finally 
dV(z N dh(z 
@) _ AND pd) 
dz Es dz 
Through parameter substitution, we find 
z=Lo 2N2W, h(Lg) 
ees i@a; 2 / h(z)la — h(z)ldh. 
Le Jz=0 ésLe n(0) 


As V(O) = 0, A(O) = ba (Vbi — Vas). Incidentally, the necessary gate-source 


voltage to fully close the channel at the source end is the pinch-off voltage Vp: 
29Nb 


Vp = Vpi — a“ —. (2.4) 
2€s 
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Using Vp, we can write Equation (2.2) in the following form: 


h@) = a Vie) Nes = Voi (2.5) 


Voi — Vp 


The required value (Lag) is now found very easily — we know that V(z = LG) = Vps 
and therefore 


Vos — Ves — Voi 
iestase/ =. (2.6) 
i Voi — Ve 


We can now finally solve the current integral using the constant-mobility assumption, 
and find for the drain current: 


In92 3 
q° N5iuna? Wo 
Ip(Vcs, Vos) = —2* > (2.7) 
6e,LG 
3Vps (Vps — Vos + Voi)?/* — (Voi — Vas)?” 
Voi — Vp (Voi — Vp)3/2 : 


We had so far assumed that the channel would remain at least partially open. This 
requires that h(Lg) < a. From Equation (2.6) we find that this translates into 


Vos < Ves — Ve = Vx, (2.8) 


where V is the knee voltage. 
For 


e Vos < Vx the MESFET is in the linear regime, while for 
e Vps > Vy it is in the saturated regime. 


Figure 2.5 shows simulated output current-voltage characteristics for a hypotheti- 


cal MESFET with a pinch-off voltage Vp = —2V and a built-in voltage of Vbi = 
0.7 V, calculated using Equation (2.7). The drain current has been normalised to 
q Npuna® / 6¢sLc). 


Note that for very small Vps, the dependence of Jp on Vpg is almost linear. MES- 
FETs can be used as electronically controllable resistors, e.g. in variable microwave 
attenuators or in transmit/receive switches. 

For Vps — Vx, the drain current saturates. A common assumption in simple FET 
models is that for Vps > Vx, Ip(Vps > Ve) = Ip(Vps = Ve) = const. 

Conceptually, current continuity in the constant-mobility model requires that for 
increasing z, the electric field increases to compensate for the decrease in undepleted 
channel height. Near Vps = Vk, a — h(z) > O would imply dV(z)/dz — oo at the 
drain end, which is a fundamental flaw of this model. It is still valuable to investigate 
MESFET behaviour at low Vps. 


Constant-velocity approximation 


In all semiconductor materials, the assumption that the carrier velocity increases linearly 
with increasing electric field, i.e. that mobility is a constant, only holds for small electric 
fields. For large electric fields, the carrier velocity becomes independent of the electric 
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Simulated output /—V characteristics of a MESFET using the constant-mobility assumption 
(Vp = —2V, Vpi = 0.7 V). 


field (in good approximation); its value is then called the drift saturation velocity, Vsat.n 
for electrons Or Usat,p for holes, respectively. 

Let us now boldly assume that the charge carriers in the channel reach their drift 
saturation velocity immediately after entering the channel at the source side. 

The channel current in our n-channel MESFET now becomes 


T(z) = qNpUsat.nWo[a — h(z)] = const = Ip, 


due to the current continuity requirement. As the carrier velocity is now constant, this 
implies that the channel height must also be constant: h(z) = const = h. Hence, 


Ip = QNDUsat.nWo(a — h). (2.9) 


The extension of the space charge region can be easily calculated at z = O fora 
homogeneous channel doping profile: 


Voi — Ves 


h = h(0) = a, | ————.. (2.10) 
Vbi — Ve 
Inserting this into Equation (2.9), we find 
Voi — Vas 
Ip = dNDUsat,n Waa (: = : (2.11) 
Vbi — Vp 
The value for Vgs = 0 is referred to as [pss: 
1 
Ipss = qNDUsat,n Wea | 1 — ae |" (2.12) 
Wi 


The constant-velocity model is a priori only valid for high electric fields in the 
channel, in the saturation region of MESFET operation (Vps > Vx). For very small 
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Vps, Equation (2.7) still holds, and the electric field will not ‘jump’ close to source, 
in every case there will be a region close to source where the constant mobility approxi- 
mation is more appropriate. More realistic models of MESFET operation will therefore 
have to combine the constant-velocity and constant-mobility approaches, as was first 
pointed out by Pucel, Haus and Statz [46]. This is, however, beyond the scope of this 
introduction. 

In a technical MESFET with short gate length and in saturation region, the constant- 
mobility regime is restricted to an area very close to source, and the majority of the 
channel is velocity saturated. Equation (2.7) is, therefore, a good approximation for 
Ip(Vgs > Vp) in saturation. 

In practical cases, the onset of saturation is not due to h(Lg) — a, the channel 
pinching off at the drain end, but due to the onset of velocity saturation in the chan- 
nel. This occurs much earlier, and hence the FET will pass from the linear to the 
saturated regime at significantly lower Vps than predicted by the constant-mobility 
model. 

The discussion so far was restricted to MESFETs with constant doping concentra- 
tion in the channel. Often, however, Np varies in the channel in the y direction. Two 
important examples are as follows: 


(i) The ion-implanted MESFET (see Figure 2.2(b)). Here, the doping concentration 
varies according to 


Q y— Rp\* 
nt ae o| (re) |: 
where Q is the implanted dose, o the standard deviation and Rp the projected range. 
(ii) The pulse-doped MESFET, where only a fraction of the channel is highly doped 
(see Figure 2.6). The discussion of the pulse-doped MESFET is interesting because 
it has a distribution of mobile carriers similar to that of the HEMT which will be 
discussed further down. 


Nb 4 


Np2 


Noi 


y 
0 da 


Channel doping profile of a pulse-doped MESFET. 
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The general procedure to handle these non-uniform doping profiles is as follows: 
Poisson’s equation is used to obtain a relationship between the potential in the y 
direction and the space charge distribution Np(y): 


aV(y) 
dy? ~ 


q 
=— Nol): 
Es 
Integrating Poisson’s equation twice yields the required relationship between the gate- 


channel voltage and the extension of the space charge region. The current is now found 
by integrating the free carrier concentration over the undepleted channel cross-section: 


a 
p= 4 vsanWe f No(idy. 
h(z) 
In the case of a pulse-doped MESFET [40] and Np2 >> Np, the calculation yields 


a (Vbi- Vas) 
/i+(4 1) Gaaey 


a] 


(2.13) 


Ip = Ipss | | 


Figure 2.7 compares the transfer characteristics Ip = f(Vcs) for a homogeneously 
doped MESFET and a pulse-doped MESFET with a/d = 1.1, ie. where only 10% 
of the channel is highly doped. The drain current is normalised to the respective [pss, 
which will be different in both cases. Figure 2.7 should not be read suggesting that the 
pulse-doped MESFET has a lower transconductance than the homogeneously doped 
MESFET. 


1.6 r r r 
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Transfer characteristics Ip = f(Vcs) for homogeneously and pulse-doped MESFETs. 
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A striking difference is that Ip = f (Vas) is very linear for the pulse-doped MESFET, 
which is an important advantage from the circuit designer’s point of view. 

All of the above models consider the drain current in the saturation regime to be 
independent of the drain-source voltage. In reality, however, [p depends weakly on Vps 
there. 

The predominant reason for the Jp = f(Vps) behaviour in the saturated regime is 
the scattering of charge carriers into the buffer/substrate layer under the channel. For a 
semi-insulating substrate, where the Fermi level is near mid-gap, the potential barrier 
between the channel and the substrate is approximately Eg/2 and may be overcome 
by electrons with sufficient kinetic energy. These electrons may produce two different 
effects, both of which lead to an Ip = f(Vps) dependence: 


(i) They may lead to a parasitic conduction current through the buffer or the substrate, 
adding to the channel current. 

(ii) They may be captured by crystal faults or impurities in the buffer or the substrate, 
which act as charge traps. The resulting modification of charge below the channel 
influences the channel cross-section, just as a gate electrode would (‘backgating’). 


The latter effect leads to a pronounced frequency dependence of the Ip = f(Vps) 
behaviour. 


Large-signal CAD model 


For circuit design applications, the physical models considered so far are not conve- 
nient. For once, it would be useful to have a model which describes the full range of 
operation in one closed formula. More importantly, the physical design parameters such 
as channel thickness and doping concentration are often not accessible to the circuit 
designer. 

Large-signal CAD models, therefore, are empirical in nature and have extractable 
parameters which can be determined from measurements on the final device. 

An early empirical model which may be used for MESFETs in the saturation regime 
is the “square-law’ JFET model implemented in SPICE: 


Ip(Vos) = B(Vas — Vp)’, (2.14) 
where 
Ipss 
p=—. 
Vp 


It fits the transfer characteristics of the constant-mobility model quite well at Vps = Vx. 
The model can be made to fit to non-parabolic transfer characteristics through a 
modification introduced by Statz et al. [58]: 
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B(Ves — Vp)? 
1+ a(Vas — Vp)” 
The linear region can be elegantly included in a closed form through the use of a 
hyperbolic tangent function, as was first pointed out by Curtice [10]: 


Ip(Ves) = (2.15) 


Ip(Ves, Vos) = Is(Ves) tanh (yv Vps), (2.16) 


where Is(Vgs) is the drain current according to Equation (2.15). 

The dependence of the drain current on the drain-source voltage in the saturation 
regime can be introduced through an additional 1 + AVps term. We arrive finally at the 
common CAD model expression for the MESFET: 

B(Ves — Vp)? 


Ip(Voes, Vi = ————- tanh (y V 1+AVps). 2.17 
p(Vocs. Vos) ib aves= Vo) anh (y Vps)(1 + AVps) (2.17) 


Capacitance model 

We will now leave the quasi-static realm and introduce capacitances. The discussion 
will be restricted first to the intrinsic FET structure, while parasitic capacitances will be 
introduced in context with the small-signal equivalent circuit. 

Assume a MESFET with a homogeneously doped channel region with Vps = 0, 
which implies a constant extension of the gate space charge region, h. The charge on 
the gate electrode counter-balances the charge in the channel. In this case (n-channel) 
the gate charge is positive: 


Qco = @NpWoLo(a — h) = —qNpWoLca (: - ts) ; (2.18) 
Voi — Vp 
using Equation (2.5) and V(z) = 0 due to Vps = 0. 
The gate-channel capacitance for Vps = 0 can now be calculated as the first deriva- 
tive of the gate charge with respect to the gate-channel voltage (which is identical to 
Vas as Vps = 0): 


5Qco Voi — Vp 
Coc = =Co,/ = (2.19) 
5 Vas Voi — Vos 
where 
Np WocLga 
=o 
2(Vbi — Ve) 


For Vps > 0, the Meyer capacitance approach originally developed for MOSFETs [38] 
is often used, which distinguishes between the linear (Vps < V«) and the saturated 
(Vps > Vk) regimes: 


e For Vps < Vk, 
2 Vk — Vos \* 
Cos = -C 1 — {| ———— 
GS 3 GC (=e 7 =) 
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> vj A 
Cop = zac 1- Gea 


e For Vps > Vk, 


2Vi — Vos 


2, 
Cos = raace 


Con=0. 


The intrinsic Cgp = 0 in the saturated regime means that the gate-drain voltage has no 


influence on the channel charge. 


Parasitic circuit elements 


Our discussion so far was restricted to the intrinsic transistor, more precisely to the chan- 
nel region. A realistic transistor model will also have to take extrinsic circuit elements 
into account (see Figure 2.8). The most important ones are: 


(i) The source resistance Rs and the drain resistance Rp. They contain contributions 
from the semiconductor—metal contact at the source, and the semiconductor regions 
between the channel and the source and drain contacts, respectively. The source 
contact is most important, because it has a direct impact on the controlling gate- 
source voltage. Because the gate current can be generally neglected, the intrinsic 
gate-source voltage Vgs is related to the externally applied Vcgs_.¢ as follows: 


Vos = Vas,e — RsIp. 


| O 
Rp 
Cede == 
oO | @ Vos —— Vps,e 
Re Cos,e 
Cose == Yes 
Vas,e v 
Rg 
O O 
Fig. 2.8 Extrinsic circuit elements in the MESFET. The transistor symbol in the shaded box is the 


intrinsic transistor discussed so far. 
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(ii) The gate resistance, which is due to the series resistance of the gate electrode (in x 
direction in Figure 2.3). This can be a problem especially in modern devices with 
very small gate length La, typically <0.25 wm. 

(iii) The parasitic capacitances are Cesc, Cgp,e and Cps,e. They are mostly due to 
the contact and interconnect metallisations within the transistor structure. Cgp.e is 
of particular importance because it is in a feedback path in the frequently used 
common-source transistor configuration where it will give rise to the so-called 
Miller capacitance, and also may lead to amplifier instability. 


Small-signal equivalent circuit 


Introduction: small-signal versus large-signal model 

The physical behaviour of electronic devices is generally non-linear, as has been seen 
above. However, in many cases, we only deal with very small perturbations around a 
given bias point, so that the non-linear functions can be approximated by linear ones, 
dramatically simplifying the calculation effort. 

For example, the non-linear dependence of the drain current on the gate-source 
and drain-source voltages, Ip(Vcs, Vps), can be approximated for small perturbations 
around a bias point (/p,o, Ves.o, Vpos,o) by a two-dimensional Taylor series, which is 
aborted after the linear term: 

bIp bf 


+ vsti. (2.20) 


iq ——— Use 
i OVGs al dVps 


The lower-case symbols ig, vgs and vgs denote small deviations from the bias point: 


ia = Up — Ip,0); vgs = (Vas — Vas.o); vas = (Vos — Vps.0)- 


MESFET small-signal equivalent circuit 
Refer again to Equation (2.20). 
The first partial derivative is the transconductance gm: 


dlp 
Wa |Ves.o.Vps,0 = Sm- 


In saturation, we can use Equation (2.17) to calculate its bias-dependent value: 


oy dVcs | 1+ a(Ves — Vp) 


2(Vas,0 — Vp)[1 + @(Vas,o — Vp)] — «(Vas,0 — Ve)? 
2 
[1 + a(Vas,0 — Ve)] 
__,0(Vas,0 — Vp) + 2(Vas,o — Vp) 


= ; (2.21) 
‘ [1 + @(Voso — Ve)]° 


5 — Vp)? 
SE iat ervosi-+ No) 


~ B 


assuming that AVps < 1, and that when sufficiently in saturation, tanh (yVps) —> 1. 
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The second partial derivative in Equation (2.20) is the output conductance gas: 
dlp 
\Vcs,o.Vps,0 = &ds- 


dVps 
In saturation, assuming again that tanh (y Vps) —> 1: 
B(Vas,0 — Vp)? 


1 + a(Vos,o — Vp) 
= XIp 0, (2.22) 


8ds = 


if we assume also AVps.o < 1. 
In saturation, the bias-dependent intrinsic gate-source capacitance is (see p. 57): 


C 20 Np WoLga 
Gsi = 7Coc = 4 : 
“3s 3 - ./(Voi — Ve)(Vbi — Ves.) 


To this, we have to add the extrinsic gate-source capacitance, so that 


(2.23) 


Cos = Cas,i + Cas,e- 
The gate-drain capacitance has only an extrinsic component (refer again to p. 57): 
Cop = CopD,e- 
Likewise, Cps is purely extrinsic: 
Cos = Cps.e- 


Adding the series resistances Rg, Rs and Rp, we arrive at a first small-signal equivalent 
circuit for the MESFET (see Figure 2.9). 
A more complete small-signal equivalent circuit will add two more elements: 


(i) the resistance Rj which improves the modelling of the non-velocity-saturated part 
of the channel near the source; 
(ii) the domain capacitance Cpc. 


The domain capacitance accounts for a charge dipole forming at the drain end of the 
channel. Provided that Ri < 1/(@wCgs), it can safely be omitted as it is absorbed in 
Cps. 

It is important to note that in Figure 2.10, Vgs drops over Cas only. 


Re p Rp 
Im Vgs 
sO ame N\\—0D 
Vgs | ——— Cgs { 9ds —— Cys 
Y 
Rs 


iS) 


Simple small-signal equivalent circuit of a MESFET. 
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Cop 
Ro Rp 
G oN\W\—_— Kw D 
Vgs Cos Im Vgs 

Coco | 
v 9ds ——— Cos 
WW 

Rs 


S 


Small-signal equivalent circuit of a MESFET including Rj and Cpc. 


G 
Re ik Im Vgs 
AN oe || rf Ti 
ig 4 Vas | —— Cog Dds Cos Ad 
Y v iL 
(a) 
Im Vgs 
a4 Bs 
ig i | —— Cegst Cen v Cos Mia 
VY rw) 


(b) 


(a) Simplified MESFET small-signal equivalent circuit connected for measuring fr. 
(b) Collapsed equivalent circuit due to current source at input and short circuit at output. 


Transit frequency 
The transit frequency of a two-port is defined as the frequency where the magnitude of 
the short-circuit current gain hz; becomes one: 


inti=tol= We ae (2.24) 
g& 


To calculate fy from the small-signal parameters, we refer back to the simple MESFET 
equivalent circuit (Figure 2.9), and further simplify it by omitting the series resistances 
Rs and Rg, which in technical MESFETs are quite small. 

Equation (2.24) can be interpreted as forcing a current ig into the gate terminal, while 
measuring a short-circuit current ig between the source and the drain terminals. 

The appropriate connections are indicated in Figure 2.11(a). Because Rg is in series 
with an ideal current source, it has no effect here and can be omitted. Elements 
ds and Cps are short-circuited and can be omitted also. Cgp is in parallel to Ces. 
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Figure 2.1 1(b) shows the extremely simple equivalent circuit after taking these findings 
into consideration. The current transfer function is now: 


&m 


ig = 8mUgs = ig —————. 
mss "8 j@(Cos + Cop) 


which means that 
&m 


hz\(@) = ————___-. 
= jo(Cgs + Cop) 


The magnitude of 42; becomes unity at 


OT = __Sm_ 
Cos + Cgp’ 
or 
fr = (2.25) 


~ 22 (Cogs + Cop) 


We will now relate the transit frequency to physical parameters. Let us go back to the 
simple velocity-saturated MESFET model (Section 2.2.3). In the simple model, we do 
not use the Meyer capacitance approach, but attribute the full gate-channel capacitance 
Equation (2.19) to the gate-source capacitance. Parasitic capacitances are neglected: 


_ gNpWcLca 
2./(Vbi — Ve)(Voi — Vas) 


For the transconductance, we derive Equation (2.11) with respect to Vgsg and find 


Cos 


= GND UVsat Woa 
2/ (Voi — Ve)(Vbi — Vas) 


&m 


Therefore, 


&m 1 Usat 
=> = * 2.26 
fr 20 Cos 20 Lg ( ) 


The transit frequency can be directly deduced from the carrier transit time through the 
channel. 


Maximum frequency of oscillation 

The maximum frequency of oscillation fmax is a measure of the power gain of a two- 
port (see Section 5.2.4). A common formulation [32] quoted in [36] for finax from the 
small-signal parameters is 


ft 
2./(RG + Ri + Rs) gas + 2 frRGCop 


The expression refers to the equivalent circuit in Figure 2.10, but neglecting Cpc. 

Note the importance of the series resistances, which did not factor into the calculation 
of fr at all. fmax is much more useful to benchmark FETs for power amplification at 
microwave frequencies. 


fmax = (2.27) 
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2.2.6 Noise performance 


When discussing the noise performance of semiconductor devices, we have to distin- 
guish between microwave noise, where the spectral power density of the contributing 
noise sources is frequency-independent (white noise), and low-frequency noise phe- 
nomena, where the spectral power density of the contributing noise sources increases 
with decreasing frequency. 


Microwave noise 

To assess the microwave noise performance of a FET, in principle three different noise 
sources need to be included, each due to the stochastic movement of charge carriers in 
different parts of the device. The simplified equivalent circuit in Figure 2.12 contains 
the MESFET’s main noise sources: 


(i) Areas where mobility is constant, i.e. the region behaves like an ohmic resistor, 
give rise to thermal or Johnson noise. In a realistic MESFET, we need to include 
Johnson noise for the gate resistance Rg and the source resistance Rg. The mean- 
squared value of a Johnson noise source can be expressed as: (lel?) = 8kTRAf, 
where R is the resistance and Af is the measurement bandwidth. Hence, 


(iec) = 8kT RGAf 


(ies?) = 8kT Rs Af. 


(ii) Current flowing across an energy barrier gives rise to shot noise, which is pro- 
portional to the current. Here, a potential gate leakage current flows across the 
gate-channel Schottky diode, resulting in a shot noise component of 


(Iii?) = 8q/aic, 


where Jgic is the DC value of the gate leakage current. 


7 12 
Re (Ligic| ) 


Fig. 2.12 Simplified noise equivalent circuit of a MESFET. 
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(iii) In the channel, the carrier velocity experiences fluctuations due to phonon and 
impurity scattering — this kind of noise is commonly called channel noise, first 
predicted by van der Ziel [62]. According to van der Ziel, 


(iial*) = 8kT gmPAf, 


where P is a fitting parameter equal to 1... 3. 

Note that van der Ziel did not yet include velocity saturation effects. In fact, ana- 
lytic FET noise models are strictly valid only below the onset of saturation. How- 
ever, deviating behaviour in the saturated region and in the presence of velocity 
saturation can be accommodated by a bias dependence in the parameter P [20]. 

(iv) Another effect must be taken into account. Due to the close proximity of the gate 
electrode to the channel, any charge fluctuation in the channel will lead to a phase 
fluctuation with the opposite sign on the gate electrode. This effect is the induced 
gate noise and was again pointed out by van der Ziel [63]: 


(ligI*) = 8KTAf(@Cos)?R/ 8m. 


where R is a fitting parameter, which accommodates different geometries and bias 
points. 


Because of their linked physical origin, (lial?) and (lig|?} are not statistically indepen- 
dent, but show a strong correlation. The correlation coefficient is imaginary (due to the 
capacitive coupling) and strongly bias-dependent. 

The gate leakage noise contribution (li sel) is commonly neglected, because the gate 
diode is reverse biased. Using this assumption, Cappy [6] expressed the minimum noise 
figure as 


Fea= T+ 2\/ P oe 20 V PRL (2.28) 
T 
(Rg + Ro) + PR(— C7?) 
SE eT ap SCAR 


where C is the magnitude of the correlation coefficient. For C = 1, Equation (2.28) is 
equivalent to the famous Fukui equation [21] for the minimum noise figure of FETs: 


Fmin = 1+ lef y 8m(Rg + Rs), (2.29) 
zi 


where kg is a fitting factor. 

Both Equations (2.28) and (2.29) calculate fr using the approximation in Equa- 
tion (2.25). 

The noise equations contain an implicit bias dependence, which cannot be discussed 
in detail. Delagebeaudeuf et al. [12] showed for the bias dependence of parameter P, 


Ip 


a (2.30) 
Ecrit LG&m 


2 van der Ziel discusses this in terms of the channel conductance gao, which is identical to the transconduc- 
tance gm at the very low Vps. 
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which points towards a ./7p dependence for Fyin, at least where gm * const. At very 
low Ip, however, gm also decreases and Fmin increases again. Optimum drain currents 
for low-noise operation are typically at 0.15—0.25/pss. In Equation (2.30), Ecrit is the 
critical electric field for velocity saturation. 


Low-frequency noise 
Low-frequency noise is only discussed briefly here; however, it will be shown that it has 
significant impact on circuit performance, especially in oscillators. 

While there are quantum mechanical reasons for low-frequency noise occurring in 
any conducting or semi-conducting material, practical devices exhibit low-frequency 
noise levels significantly above the quantum limit. This excess noise is due to interaction 
with impurities or dislocations which create energy levels inside of the forbidden gap of 
semiconductor materials. These traps may 


e locally lead to enhanced scattering of charge carriers — mobility fluctuation noise; or 
e modify the number of charge carriers through trapping and release, with a character- 
istic time constant t — number fluctuation noise. 


Even though it was derived at first only for mobility fluctuation noise in bulk semi- 
conductors, the empirical Hooge equation [24] is often applied to low-frequency noise 
parameters. Applied to the drain current Jp, the Hooge relationship finds for the spectral 
power density of the drain current fluctuations: 
2 OH 
Sip = DNF" 
where JN is the number of carriers in a given volume and f is the frequency. Due to the 
observed frequency relationship, low-frequency noise is often coined //f noise. 

This ideal 1/f noise spectrum is frequently superimposed by generation- 
recombination spectra through a trap with distinct capture and re-emission time 
constant t. Such traps lead to noise with a low-pass limited spectral noise power density: 

1 
1+ (Qnf)?t2° 
Figure 2.13 shows qualitatively a low-frequency noise spectrum of the drain current 
in the presence of a distinct trap with generation-recombination noise, a 1/f noise 
component and white noise at higher frequencies. 

The absolute spectral density depends very strongly on the technology. A high 
spectral density at a given frequency is indicative for a high number of defects or 
deep impurities. So it is not surprising that low-frequency noise is much more pro- 
nounced for ion-implanted MESFETs (with significant radiation-induced defects) than 
for epitaxially grown structures. 


(2.31) 


Sn(f) ~ (2.32) 


MESFETs in the third millennium 


Commercially, MESFETs were the transistors of choice for microwave circuits, includ- 
ing monolithic microwave ICs (MMICs), from the 1970s well into the 1990s. Initially, 
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Qualitative low-frequency noise spectrum of the drain current in the presence of 1/f noise, 
generation-recombination noise and white noise. 


—o— fmax(Vds = 1.5V) |--4— 
—#— fmax(Vds = 3V) -3 


H iiiiil ee [E i} PUrtits 0 i i ; j j 
108 109 101° 10! -0.6 -0.4-0.2 0 02 04 06 08 1 
Frequency (Hz) Vgs (V) 
(a) (b) 


(a) |h21| of a2 - 0.12 wm - 75 wm ion-implanted MESFET (Vps = 1.2 V). (b) fmax of a 
2-0.12\1m- 25 wm device in the same technology (H. Hsia, Z. Tang, D. Caruth, D. Becher 
and M. Feng, /EEE Electron Device Letters, Vol. EDL-20, No. 5, pp. 245-247, May 1999. 
©1999 TEEE). 


they only gained importance on GaAs substrates - MESFET structures on other mate- 
rials, including Si, were but an academic curiosity. GaAs MESFETs have been almost 
exclusively replaced in contemporary designs — either by HBTs or HFETs. MOSFETs 
are also making inroads into the former realm of GaAs MESFETs. 

It should be noted that MESFETs did reach cutoff frequencies in excess of 100 GHz, 
even for devices fabricated by ion implantation. Hsia and co-workers [25] reported a 
device with Lg = 0.12\1m, which exhibited fr = 121 GHz and fmax = 160 GHz, 
albeit not at the same bias point or the same device size. Figure 2.14 shows hz; versus 
frequency and fimax versus Vcs for this technology. 
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Gate contact 


Drain contact 


Source contacts grounded to back side through via holes 


Example of a SiC power MESFET (Chip photo adapted from M. Siidow, K. Andersson, 

N. Billstrém, J. Gran, H. Hjelmgren, J. Nilsson, P.-A. Nilsson, J. Stahl, H. Zirath and N. 
Rorsman, JEEE Transactions on Microwave Theory and Techniques, Vol. MTT-54, No. 12, 
pp. 4072-4078, December 2006. ©2006 IEEE). 


Note that the maximum /r is measured at Vos = 0.6 V, i.e. the gate electrode starts 
to be forward-biased. This is also visible from the lower |2;| below 1 GHz. For a more 
practical Ves = OV, fr = 70 GHz. fmax peaks also at Vos = 0.6 V, but is improved by 
a larger Vps, because the latter will further reduce Cyc. 

The record fr and fmax values are measured for different device geometries. This is 
a common trick — fy is measured for a larger gate finger width (here, 75 j1m), because 
Rg does not matter, and the wider finger leads to a better ratio of intrinsic and parasitic 
Cogs. For finax, Rg does matter, and hence a smaller gate finger width is chosen (here 
25 mM). 

The MESFET structure makes a strong comeback on SiC, with important applica- 
tions in power amplifiers, e.g. for mobile radio base stations in the lower GHz range. 
Figure 2.15 shows an example of such a structure [60]. 

Note the multi-finger layout which is very common in power FETs. Due to the limited 
current-carrying ability per unit width (in this case 350mAmm_!), the total device 
periphery needs to be extended. As the series resistance per unit length of the gate stripe 
is quite high in case of submicron gate length (here, Lg = 0.4\1m), Rp can be kept 
small by choosing a short individual gate finger length and connecting transistor cells 
in parallel, in this example for a total gate width Wc = 0.4mm. 

The major advantage of a semiconductor material with a large band gap is the very 
high electric field at breakdown. In this case, the gate-drain breakdown voltage is 180 V. 
The transistor shown produces a saturated output power of 3.1 W at 3 GHz, or 7.8 W/mm 
gate width, when biased at a drain-source voltage Vps = 65 V. The power added 
efficiency in this mode of operation is 70%. 

The device has a small-signal fr = 8 GHz and a maximum frequency of oscilla- 
tion fmax = 20GHz. Record transit frequencies were reported at 28 GHz, and record 
maximum frequencies of oscillation at 50 GHz. 
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High electron mobility transistor 


While the MESFET is conceptually a very simple device, yielding sufficient perfor- 
mance well into the millimetre wave range, it does not unleash the full potential of 
group III-V semiconductor materials. The fact that free charge carriers and ionised 
dopants share the same space in MESFETs leads to a reduction of low-field mobility 
through electrostatic fields, a major effect which we will consider first. 


The importance of Coulomb scattering 


Figure 2.16 shows the electron mobility for nominally undoped GaAs as a function of 
the absolute temperature, along with the two dominant scattering mechanisms. Other 
scattering mechanisms have been omitted for clarity. 

We notice that at room temperature (300 K) the scattering of electrons is mostly due 
to lattice vibrations — longitudinal optical phonons. As we lower the temperature and 
lattice vibrations are increasingly suppressed, another mechanism becomes dominant — 
Coulomb scattering. Coulomb scattering is due to the electrostatic force between the 
mobile charge carriers and the fixed ionised atoms. In doped semiconductors, the main 
source of fixed charge are the ionised doping atoms. Therefore, the main electrostatic 
effect we need to consider in an n-channel MESFET is between the negatively charged 
electrons and the positively charged ionised donors. This is clearly shown in Figure 2.16 
through the strong doping dependence of the mobility. 

From electrostatic theory we know that the force created between two objects with a 
charge of magnitude g — the elementary charge — and the opposite sign of charge is 

q 1 


ae ee ee 2.33 
desde a2 C33) 


Longitudinal optical phonons ™*__ Coulomb 
interaction 
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Electron mobility versus absolute temperature for nominally undoped GaAs, and the underlying 
dominant scattering mechanisms. Data adapted from [59]. 
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With increasing doping concentration, the mobility limiting effect of Coulomb 
scattering will become more pronounced. This is also shown in Figure 2.16. 

Coulomb scattering becomes an increasing problem as we reduce the gate length in 
MESFETs: 


e As we reduce the gate length Lc, we also have to reduce the channel thickness a to 
keep the aspect ratio Lg/a constant.° 

e To compensate for the reduction in a, we need to increase the channel doping 
concentration Np. 

e Then, however, the significance of Coulomb scattering will increase and reduce the 
mobility! 


If the physical co-location of free and fixed charge is the reason for the increased domi- 
nance of Coulomb scattering, then the following idea is immediately apparent: why not 
physically separate free and fixed charge, i.e. the electrons and the ionised donors in an 
n-channel device? 

To find out how this may be done, let us investigate a heterojunction in the 
n* AlGaAs/p~ GaAs material system. The AlGaAs/GaAs material system has the 
advantage that the lattice constant is almost independent of the material composition. 

The band gap in an Al,Ga;_,As/GaAs heterojunction adjusts as follows: 


E,(GaAs) 1.42eV 


AEc(AlGaAs — GaAs) 0.62 AE, for xa) < 0.37 
AE,(AlGaAs — GaAs) 1.255 eV xal as above 


In this example, the Al concentration, doping types and concentrations are: 


Alop.25Gao.75 AS n-type Np = 10!8 cm-3 
GaAs p-type Na = 10° cm-? 


Further, a thin (~5-10nm) layer of undoped AlGaAs is inserted at the hetero- 
junction — this is the spacer layer. Figure 2.17 shows this material combination 
schematically. When discussing heterostructures, the doping type of large-gap mate- 
rials will be denoted with capital letters, while the doping type of narrow-gap materials 
is shown in lower case letters. 

The conduction band diagram of this heterostructure is shown in Figure 2.18. The 
discontinuity at the AlGaAs/GaAs interface, AEc = 0.2 eV, and the potential bar- 
rier towards the p" -GaAs form a triangular quantum well structure, which is the most 
important feature — note how it dips below the Fermi level. Close to the hetero-interface, 
the potential in the GaAs layer can be approximated by a linear function. 


3 Otherwise, the assumption that the electric field is directed predominantly in parallel to the surface will 
break down. Among other things, this would significantly increase the output conductance in the saturated 
regime. 
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AlGaAs/GaAs n*-i-p heterostructure. 
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Ny, GaAs 
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‘A, GaAs 
3 


AlGaAs 


Conduction band diagram of the AlGaAs/GaAs heterostructure. 


Charge control 


If we force the free electrons out of the n-AlGaAs layer, they will congregate in the 
potential well where they are separated from the ionised donor atoms by the undoped 
AlGaAs spacer layer — the sought-after reduction in Coulomb scattering can be achieved 
this way. 

We can ‘force’ the free electrons to leave the AlGaAs when we deplete the doped 
AlGaAs layer (the supply layer) by means of a Schottky contact. The electrons can then 
either tunnel through the spacer layer or overcome the conduction band spike at the 
heterostructure by thermionic emission. 

Note that from now on, it will suffice to draw just the conduction band diagram, 
because we consider electrons only. 

Figure 2.19 represents the band diagram, without applied external voltage, of a high 
electron mobility transistor, or HEMT. By applying a positive gate voltage, the density 
of free electrons in the potential well increases; a negative gate voltage will decrease it. 

An important difference between the MESFET and the HEMT is the current con- 
trol mechanism: in the MESFET, we controlled the thickness of the channel, while the 
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Fig. 2.19 Conduction band diagram of HEMT in thermodynamic equilibrium. 
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Fig. 2.20 Approximation of the HEMT channel region as a triangular quantum well. 


density of charge carriers in the channel remained constant. Here we change the density 
of carriers in the channel, while the thickness of the channel, given by the triangular 
well, remains approximately constant. 

The free carrier ensemble in the channel is called a two-dimensional electron gas 
(2DEG). 

Let us investigate the triangular potential well in more detail. First, we have to be 
aware that the triangular potential well is narrow enough to introduce quantisation of 
energy levels. Consider Figure 2.20. Note that for convenience, y = 0 at the hetero- 
interface. 

We initially assume that the potential walls are infinitely high. In the potential well, 
electrons may only occupy the discrete energy levels E), with/ € [0,1,2,...]. 
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Solution of Schrédinger’s equation yields these energies: 


2 3 2 
h 3 3\ ]3 
Hie (x) Few (1+ 3) (2.34) 


where €y is the y component of the electric field in the well. 
The potential increases linearly beyond the heterostructure. The electric field as the 
gradient of the potential is therefore constant: 


Lal 


Ey = &3 = —dV(y)/dy. 
The discontinuity of the electric field at y = 0 necessitates a sheet charge in this plane, 


whose charge density is 


dV(y) 
dy ~ 


qnsg = €\€s5 = —€1 (2.35) 
where € is the dielectric constant of the semiconductor in the channel region — here 
GaAs. This sheet charge is the 2DEG. 

ng is the sum over the sheet charge densities in the discrete energy levels: ng = 
29 Mm, Where only the first two (J = 0, 1) typically need to be evaluated, because in 
practice the walls of the quantum well have a finite height, set by the conduction band 
discontinuity AEc. 

Using Es = qng/€, we find 


(2.36) 


& 
| 
ee 
g 
whe 
x 
“——_—— 
ete 
rs | 
NW] WwW 
mn |S 
a i) 
a 
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+ 
lw 
oe, 
a | 
WI 
= 
Nur 


and therefore, 
E=y ea (2.37) 
For GaAs, 
yo =2.5x 1072 eV mi 
1 = 3.2 x 1072 eVmi. 


Next, we need to calculate the sheet charge density of the 2DEG as a function of the 
Fermi energy (note that the Fermi energy is referenced to the conduction band minimum 
here). 

Consider the density of states for the two-dimensional electron gas in Figure 2.21. 
The constant D = q m* /(27h") is D = 3.24 x 10!7 m~2 V~! for GaAs. 

The density of the occupied states can be calculated from 


ns = density of states - occupation probability. 
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Density of states in the triangular quantum well. 


2.0 


GaAs 


0.1 0 0.1 0.2 
E,/eV 


Density of the 2DEG in a HEMT structure as a function of the Fermi energy. 


The probability that an allowed state is occupied must be calculated using Fermi— 


Dirac statistics here, because the Fermi energy is inside the conduction band. Only 
two discrete energy levels are considered: 


on dE me dE 
ng = D ————~ + 2D / —S—S—S—S Se (2.38) 
Eo 1+ exp (4578) E\ 1 + exp (45) 
For the integral, we find 
d: 1 : 
i Oe Gah qe, 
1+ exp(a x) a 
and therefore, the sheet charge density is 
: Ep-£, 
ns =DkT Y [+ 1)-In(it+e 7 )]. (2.39) 


1=0 


2 . : 
Because on the other hand EF] = yj ne = this transcendental equation has to be solved 
iteratively. 


Figure 2.22 [64] shows its solution for the case of Si and for the case of GaAs. For 
larger charge carrier densities, ns(Ep) can be approximated by a linear relationship: 
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q®, 


q Ve 


Conduction band diagram of a HEMT structure under gate bias control (Vg # 0). 


= Er — AEFo 
qa 


(2.40) 


ns 


For GaAs, 


A Ex0(300 K) = 0eV 
AEpo(77 K) = 25 meV 
a = 0.125 x 107! Vem’. 


It is this linear relationship which we will use in our future calculations. 

Finally, we need the relationship between ns and the gate-channel voltage Vg. This 
means the potential across supply layer and spacer has to be included in the calculation. 

We consider a structure where the supply layer is homogeneously doped and the 
spacer undoped. Integrating Poisson’s equation twice, we find a parabolic potential in 
the supply layer, and a linear potential in the spacer (see Figure 2.23). 

The built-in voltage drop over the AlGaAs layer V> is 
Y= a Nb 42 — Es(da + di), 

2€2 

with Es = qng/e,. €, is the dielectric constant in the small-bandgap material (here, 
GaAs) and € is the corresponding value in the large-bandgap region (here, AlGaAs). 

For the relationship between Ep and ns, we use the linear approximation, Equa- 
tion (2.40). Solving for ns, 


(2.41) 


€| Np AEF — AEc 
ng= v(¢ d2 + Vg — ). 


q (da a aera 
We introduce a threshold voltage Vorg as the gate-channel voltage where the interface 
carrier density disappears: 


AEpo— AEc @gNp 


Vort = Pp + ae (2.42) 
qd 2€2 
For simplification, we define a virtual increase of the supply layer thickness: 
€2a 
Ad = —. (2.43) 
q 
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We can now write Equation (2.41) in a more compact form: 
Ve —V. 
pee ic (2.44) 
q dat+d, + Ad 
The threshold voltage can be controlled via the thickness of the doped layer. Calculate 
the supply layer thickness where Vorr = 0: 


2 (« FO ©), 
Np q 


dao = da(Vorr = 9) = / (2.45) 


If now: 


dg > dag: The HEMT is normally on or operating in “depletion-mode’ — it will pass 
drain current for Ves = 0. 

da < dao: The HEMT is normally off or operating in “enhancement-mode’ — it will 
not pass drain current for Ves = 0. 


The threshold voltage in practical HEMTs is often tailored for the maximum transcon- 
ductance to occur for Veg = 0, which implies Vor < 0, as we will see further 
down. 

Gate-channel capacitance. The gate-channel capacitance can be easily calculated by 
differentiating the charge in the 2DEG with respect to Vo: 


dns WoleG 
Co=q WoL = : 2.46 
OE NO"S Wa? Gat a+ Ad vem 


for Vg > Voge. 
For Vg < Voge, the 2DEG will be depleted, and in first-order approximation, the 
gate-channel capacitance disappears. 


A practical HEMT example 
Before we continue to consider the channel current as a function of gate-source and 
drain-source voltages, let us briefly look at a practical transistor structure. 

The structure shown in Figure 2.24 is the classic cross-section of a HEMT. The 
source and drain contacts are non-rectifying (‘Ohmic’) contacts. To facilitate a low con- 
tact resistance, they sit on highly n-doped GaAs. AlGaAs habitually forms aluminium 


Gate : n*-GaAs contact layer 


Source 


MW~YFZN 


: 


n-AlGaAs supply layer 


YY IJIII]7~77 


—— 


AlGaAs spacer 
(nominally undoped) 


Y 


5-10nm 


p -GaAs buffer 
Zz 
Semi-insulating GaAs substrate 


Classic AlGaAs/GaAs HEMT structure. 
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oxide at the surface which would seriously increase the contact resistance. The alloying 
process will drive the contacts down through the heterostructure to make contact with 
the 2DEG. This is indicated by the shaded regions in Figure 2.24. 

The gate contact must be a Schottky contact as shown. It sits in a recess through the 
GaAs contact layer. The recess depth controls the gate-channel separation (dg + d;) and 
is an important technological parameter. 

The n-doped AlGaAs layer is called supply layer because its doping atoms supply 
the free carriers in the channel. 

The thickness of the spacer layer (typically 5-10 nm) controls not only the reduction 
of Coulomb scattering, but also the transfer of electrons from the supply layer into 
the channel. It must be carefully optimised. The AlGaAs in the spacer is not actually 
intrinsic — it is just not intentionally doped. 

The GaAs layer should be low doped, but it must be p-type. The free carriers in the 
channel must come from the supply layers and not from the GaAs buffer, otherwise the 
HEMT cannot be shut off under gate control — it exhibits parallel conduction. According 
to the mass action law, 


2 
1 
n =. ——— 
Pp 
Na 
As in GaAs the intrinsic carrier concentration is* ni = 2.1- 10° cm7?, even a very 


low acceptor doping concentration, e.g. Na = 10!° cm™?, will virtually eliminate free 
electrons in the p-buffer. 

The AlGaAs/GaAs heterostructure was first grown and analysed by R. Dingle at Bell 
Laboratories in 1974. For a review of early work on AlGaAs/GaAs heterostructures, 
refer to [13]. Mimura [39] was the first to practically realise a HEMT. 


Channel current — constant mobility 
So far, we only considered the case of Vps = 0. Now, we will allow Vps > 0, i.e. a 
current will flow between source and drain. This current is 


Ip = qns(Z) Up (Ez) Wo = const, 


due to current continuity in the channel. Because of the voltage drop along the chan- 
nel between a point z and a source V(z), the density of the 2DEG now becomes 


z-dependent: 
€1 


ng(z) = (dat d+ Adqe'ss Votre — V(z)]. 


Figure 2.25 shows the immediate channel region and the appropriate voltages 
affecting the channel. 
As in the MESFET, we assume that the channel is 


e one-dimensional, i.e. the electric field € has only a component in z direction (€,); 
e gradual, i.e. the carrier densities change so slowly that diffusion currents can be 
neglected. 


4+ www.ioffe.rssi.ru/S VA/NSM/Semicond/GaAs/bandstr.html 
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se 


HEMT channel region. 


As in the MESFET, we will first consider the low-field case where [zy = const. 


Ip = qns(z) Wo Mn Ez(Z) 


€1 Wo dV (z) 
= Yi V. V : 
Pee rer GS off (Z)] Mn ee 


Obeying current continuity, we find 


(2.47) 


€1 Un Wo 
(da +d, + Ad) Le 


Let f be the transconductance parameter: 


dV(z) dz 


— 
Pp dz 


Le 
[ [Vos — Vort — V(z)] 


Wi 
ee ee (2.48) 
(da +d, + Ad) Le 


Then, using parameter substitution to integrate over V instead of z: 


V(La) 
Ip=B i [Vos — Vort — V(z)] dV. 
V(0) 


Note that V(LG) = Vps, V(O) = 0. 
Hence, we obtain the current—voltage characteristics in the linear regime (small Vps): 


v2 
Ip = B | (Vas — Vost) Vos — = ; (2.49) 


For very small Vps « 2(Vcs — Voge), we note Ip © B Vps (Vas — Vor). In this regime, 
the HEMT acts as a ‘voltage-controlled resistor’. The parameters £ and Vogr can be eas- 
ily extracted if Vps = const. This is shown in Figure 2.26. The drain current is measured 
for two Ves while keeping Vps = const « 2(Ves — Vor). Linear extrapolation towards 
small Vggs provides Vosr at the intersection with the Vgs axis. Once Vos is known, the 
transconductance parameter can be calculated as 


Ip(Ves = 0) 
Vos Vott 


p=- 
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Extraction of Voge and f at small Vps. 


Channel current — constant velocity 

The ansatz in Equation (2.47) assumes that ns(z) > 0 for all 0 < z < Lg. If Vps is 
sufficiently large, the gate-channel voltage may drop below Vogr in the channel and the 
channel will become fully depleted. As V(z) < Vps, this happens first at the drain end: 
Vas — Vos = Vogt or 


Vos = Vk = Vas — Vor. (2.50) 


This corresponds to what we saw in the MESFET, where the undepleted channel height 
disappeared at the drain end. 

As in the MESFET, we can argue that velocity saturation prevents channel closure — 
nsg(z) — O implies €,(z) — oo due to the current continuity requirement, so that the 
constant-mobility assumption breaks down much earlier and vp(z) > Usat. 

Using a two-region model for the electron velocity, where mobility is constant for 
|Ez| < Ecrit and velocity is constant for |Ez| > Ecrit, we can calculate the drain-source 
voltage Vpss for which velocity saturation happens at the drain end of the channel 
(z = Lg). At this point the local electric field is €,(z = Lg) = Ecrit. Using the constant- 
mobility current Equation (2.49), we find 


2 


€1LnWe Voss 
— Vi Vore) VI oe 
D Loldy +d, + Ad) [ GS oft) Vpss 5) 


On the other hand, /p can be calculated using a constant-velocity ansatz: 
Ip = q ng WG Usat. 


In the two-region model, WpEcrit = Vsat- Further, ng can be calculated from Equa- 
tion (2.44) using Vg = Vos — Vpss, so that 


— «1 WE bn 
(dq + d, + Ad) 


Ip (Vos — Votr — Voss) Ecrit Lc. 


The drain current expressions for constant mobility and constant velocity must be 
equal for Vps = Vpss, because we are transitioning from the constant mobility to 
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Sample calculation of [p following Equation (2.52). Parameters are 8 = 83S (Vm)~!, 
Ecrit = 1kV cm—! and Lg = 0.2m. 


the constant-velocity regime. This leads to a quadratic equation in Vpss, which we can 
solve to find the necessary drain-source current for velocity saturation to set in: 


Vag = ¥, Ves = Vie" 
Vee Vel ie a i el 2) (2.51) 
Vo Vo 


where Vo = Ecrit La. 
For the corresponding drain current in the velocity-saturated case, we finally find 


Vos — Vor \7 
Inss = B V2 eas = till, (2.52) 


In Figure 2.27, a sample calculation has been performed using Equation (2.52). Note 
that for the most part, /p is a strictly linear function of Vgs. Very close to Vogr, a more 
parabolic behaviour dominates. 


Small-signal parameters 


Transconductance in the saturated regime is calculated by differentiating Equa- 

tion (2.52) with respect to Vgs: 

_ dIpss _ _ B (Vas — Vortt) 
dVcs - (a 


Vo 


(2.53) 


&m 


For large Vgs, transconductance is predicted to be constant, while it is approximately 
linearly dependent on Vgs for small Vgs. We will see later that this ideal behaviour is 
superseded by parasitic effects, however. 
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The gate-source capacitance Cgs can be found by differentiating the total channel 
charge Qy with respect to Ves: 


d d EG 
a Wo / ng (z)dz. 
0 


Cc = = 
alee Ves  dVcs 


Particularly simple — and practically important — is the case of velocity saturation in the 
whole channel. As 


Ip = Gns(Z) Vsat WG = const > ng(z) = const = ngs, 


v2 Vos — Vor \” 
Pe BV5 ee “t) =i 
d Vsat Wo Vo 


where 


and therefore 


VEL Vas — Vor \” 
japeeiie = C= | 


Usat 


The gate-source capacitance becomes in this case 


dQr 

dVcs 

Lg B Vo (Ves — Vorr) 
Ysat /(Vas — Vor) + V3 
Le 


Usat 


(2.54) 


Cos = 


As in case of the MESFET, we find for the transit time of carriers under the gate: 


La _ Cas 
T= — = —. 
Usat &m 


The gate-drain capacitance can be calculated similarly: 


in this simple model because Or 4 f(Vocp). In reality, Cgp is non-zero because of the 
geometric capacitance between the metal contacts and other parasitic effects. As in the 
MESFET, Cop < Cgs in saturation. 

For the small-signal equivalent circuit, we can use the same topology as for the 
MESFET. Accordingly, the transit frequency can be approximated by 


&m 
= : 2.55 
fT In Cos (2,55) 
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‘High electron mobility’? 


The common name ‘high electron mobility transistor’ deserves some critical reflection. 
Remember that while Coulomb scattering is the dominant mobility-limiting mecha- 
nism at cryogenic temperatures, phonon scattering is dominant at room temperature 
(Figure 2.16). 

Realistic enhancement factors for the electron mobility in HEMTs are 


e a factor of two at room temperature; 
e up to a factor of 100 at cryogenic temperatures (e.g. 77 K — liquid nitrogen). 


Furthermore, we found that in short-channel FETs velocity saturation dominates in the 
channel — hence the enhancement in mobility has two major advantages: 


(i) reduction of series resistances, most importantly Rs; 
(ii) lowering of the critical field for velocity saturation, i.e. saturated velocity will be 
reached sooner. 


However, there are other advantages of the HEMT structure which are also significant: 


e The carrier distribution is similar to that of a pulse-doped MESFET — we expect a 
constant transconductance gy and therefore a high linearity. There are parasitic effects 
which prevent this from happening — more about this later. 

e In active operation, the supply layer is fully depleted and hence the gate-source 
capacitance Cgs should be constant. Again, this is not quite true in reality (see p. 81). 

e The potential barrier towards the substrate reduces carrier injection into the substrate 
and increases output resistance. 


Note that, compared to a MESFET with an epitaxially grown channel, the HEMT 
structure is technologically not much more complicated. 

As an aside, the HEMT has many other names and has jokingly been called a multi- 
acronym device (MAD). To name but a few: 


HFET heterostructure field effect transistor 
MODFET  modulation-doped field effect transistor 
TEGFET two-dimensional electron gas FET 
SDFET selectively doped field effect transistor 


Non-ideal behaviour 


In the following pages, we will discuss how in practical HEMTs the experimentally 
observed behaviour deviates from the theory developed so far. The explanation of these 
non-ideal features will have important implications for the design of optimised HEMT 
devices. 
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Non-ideal HEMT behaviour for large Vgs 

From simple HEMT theory as outlined above, we expect that for sufficiently large Ves — 
Vorr, the drain current increases linearly with Vcgs and hence the transconductance is 
constant. Also, we would expect that the gate-source capacitance is constant in the same 
region. 

Experimentally, however, transconductance and gate-source capacitance show the 
behaviour in Figure 2.28 [1]: after a sharp increase above the threshold voltage, the 
transconductance goes through a maximum, then decreases again for higher Ves. The 
gate-source capacitance initially tracks the transconductance, as predicted by Equa- 
tion (2.54) (save for a constant parasitic contribution), but then increases further for 
higher Ves. 

This compression of the transconductance is due to a ‘parasitic MESFET? effect [33]. 
To understand its origin, please consider Figure 2.29. 
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Experimental transconductance and gate-source capacitance versus gate-source voltage. 
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Conduction band diagram of a HEMT under high Vgg. The arrows indicate the locations of the 
2DEG and three-dimensional electron gas (3DEG). 
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| 


! Ns 3DEG 


Electron sheet density 


Gate-source voltage Vgs 


Dependence of the electron sheet densities in the 2DEG and 3DEG as a function of the 
gate-source voltage. 


If Ves is sufficiently high, the conduction band minimum in the AlGaAs supply layer 
dips below the Fermi level. At this point, the free electron density in the supply layer 
will rapidly increase, the supply layer is no longer depleted. Because the free electron 
population in the AlGaAs conduction band minimum has very little confinement, it is 
referred to as the three-dimensional electron gas or 3DEG. In its low confinement, this 
channel is very similar to a MESFET’s, hence the term parasitic MESFET. 

Once the 3DEG builds up, it electrostatically shields the 2DEG from the gate elec- 
trode — the 2DEG density ng 2p saturates; any further increase in charge density due 
to a further increase in Veg will benefit only ng 3p. This is schematically shown in 
Figure 2.30. 

The rise of the 3DEG has two substantial effects: 


e Because the mobility is much lower in the ternary AlGaAs supply layer than in the 
GaAs channel region, the resulting transconductance due to the 3DEG channel is 
lower, causing the overall transconductance to decrease. 

e The additional charge under the gate leads to a strong increase in the gate-source 
capacitance. 


Recall Equation (2.55) — the simultaneous decrease in transconductance and increase in 
gate-source capacitance will have a very negative impact on the transit frequency fr. 
Using the data from Figure 2.28, this is exemplified in Figure 2.31. 

The transit frequency, which in our simple theory was predicted to be independent 
of frequency, now shows a pronounced maximum, which occurs for gate-source volt- 
ages slightly lower than the transconductance maximum. For the design of high-speed 
circuits, this is an important observation. 

The ungated FET structure can be used as a model for the situation at the onset of the 
parasitic MESFET effect, where ns 3p = Np. Figure 2.32 shows the conduction band 
diagram. 
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Gate-source voltage Vgs/V 


Calculated transit frequency of the HEMT in Figure 2.28 versus gate-source voltage. 


Conduction band diagram of an ungated HEMT structure. 


We find that 


N ‘ 
Fa= ABS =GVoy= kT ln cv = AEc — qVp, 


D 


for large Np. Vp is the built-in potential in the large-band-gap part of the heterostruc- 
ture. Using Equation (2.40), 


DN 


NS,max ~ 


AEc — AEgo — gVp2 
qa , 


To maximise “5 max, we must therefore choose a material combination with large AEc. 
In a conventional HEMT structure, ns max © | - 10!2 cm7?. 


Trapping effects 
In an Al,xGa;_x,As/GaAs HEMT, we may obtain a larger AEc by increasing the Al 
mole fraction x. However, consider the following detrimental effects. 

For xaj > 0.3, the effective energy depth of the donor level increases — the number of 
free carriers provided by a given doping density Np will decrease. 

Earlier, for xa; > 0.25, the density of deep traps (DX centres) will increase. These 
traps are energy states in the forbidden gap which can interact with the valence or con- 
duction band. In this case, they are closer to the conduction band, at an energetic depth 
Ey — they are ‘donor-like’. The ‘X’ denotes that their physical origin was long unknown. 
A trap will capture a free electron from the conduction band and eventually re-emit it. 
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The characteristic time constant for re-emission is strongly temperature-dependent: 


aaa 


ea (2.56) 


TrRE(T) = To exp ( 
where Eg is an additional energy barrier for re-emission. In AlGaAs, Ey ~ 50meV 
and Ep ~ 300 meV. 

When reducing the temperature, formerly free carriers will be ‘frozen’ and as a con- 
sequence, will no longer be available for the channel. This can be described by a shift 
in threshold voltage: 


q 
AVott = —5-Npr.ion a. 


The density of ionised traps is, using Fermi—Dirac statistics, 
Npt 
Ep—Epy : 
1+ exp (47) 


Note that the trap density Npr has been experimentally observed to be proportional to 
the donor density Np. It is now accepted that the donor atoms themselves introduce 
two different energy levels in the forbidden gap in AlGaAs: a shallow one associated 
with the ’-minimum (the direct minimum) — this is the proper donor level — and a deep 
energy state associated with the L-minimum (an indirect minimum) — this is the DX 
centre [5]. 

The effect of DX centres in the supply layer made early HEMT structures very 
problematic in cryogenic operation. 


Npt,ion = 


30 


20 


Ip(300K)/mA 
Ip(77K)/mA 


10 


0 


0 1 2 3 


Output I-V characteristics of a HEMT device with a low-temperature current collapse 
phenomenon (A. Belache, A. Vanoverschelde, G. Salmer and M. Wolny, JEEE Transactions on 
Electron Devices, Vol. ED-38, No.1, pp. 3-13, January 1991. ©1991 IEEE). 
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Figure 2.33 shows an example of a device showing such a current collapse phe- 
nomenon [3]. Apart from the change in output conductance in saturation and the various 
‘kink’ effects, which shall not be discussed here, we note 


e At low Vps, the output conductance in the linear regime decreases considerably — this 
is due to an increase in the source resistance Rs. The decrease in Rs with decreasing 
T is unexpected, as the mobility itself will increase. The decrease in the free carrier 
concentration, however, dominates. 

e In the saturated regime, Vps > 0.5 V, the transconductance also decreases signifi- 
cantly with decreasing temperature. 


The occurrence of DX centres is closely linked to the use of AlGaAs as the barrier 
material — other supply layer materials such as GaInP do not show this effect and will 
correspondingly fare better in their low-temperature performance [7]. 


Structural HEMT variations 


Increasingly, structural variations of the original HEMT concept are being used to cir- 
cumvent the non-ideal behavioural effects explained above and to improve performance. 


Pulse-doped HEMT structure 
Due to the severeness of the DX centre limitation, a method to eliminate this limitation 
has the highest priority. 

Because Npt ~ Np, the trap-induced threshold voltage shift is 


AVott ~ Np dj. 
On the other hand, the supply layer must be able to supply the necessary carrier density 
in the 2DEG: 


Np da > ns. 


If, therefore, we concentrate the doping in a narrow sheet — increase Np and decrease 
dq — the trap-induced threshold voltage shift can be drastically reduced. 

This concept leads to the delta-doped (or pulse-doped) HEMT structure (see 
Figure 2.34). 


Gate 


Source Drain 
ee barrier 


dy n*-doped supply layer |AlGaAs 
T ndened supply layer] 


ef fe 


Semi-insulating GaAs substrate 


Layer structure of a delta-doped HEMT. 
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Conduction band diagram of a pulse-doped HEMT structure. 


The restriction of doping to only a narrow sheet of the wide-gap layer, of course, also 
modifies the band structure. The Poisson equation 
d*V p 


dy2€ 
tells us that the potential V will have a linear y dependence if p ~ 0. Where p 4 
0 = const, V will have a parabolic dependence on y. These principles are visible in the 
conduction band diagram of a pulse-doped HEMT structure (see Figure 2.35). 
As an additional advantage, the pulse-doped HEMT can be expected to have lower 
gate leakage because of the lower doping of the region immediately under the gate. 


Pseudomorphic HEMT structure 

So far, the AlGaAs/GaAs HEMT structures considered were lattice-matched — a signif- 
icant advantage of the (Al,Ga)As material system is that the lattice constant is almost 
independent of the Al content. 

We will now deliberately leave the lattice match principle behind and allow for 
material combinations which are lattice-mismatched, but where the lattice difference is 
accommodated by elastic deformation of the crystal — pseudomorphic structures. This 
gives us greater flexibility in the choice of materials. 

Let us replace the GaAs channel in a conventional HEMT with an InGaAs chan- 
nel. This leads to a double-heterostructure because the GaAs buffer and substrate shall 
be maintained. In Figure 2.36, the conduction band diagram of an example structure 
combining the pseudomorphic channel layer with a pulse-doped barrier is shown. 

Compared to the conventional HEMT, this structure has several advantages: 


The significantly higher conduction band discontinuity increases the maximum den- 
sity of the 2DEG from about 1-10!* m7? for the conventional HEMT to about 
2-10!* cm? for the pseudomorphic HEMT as shown. 

The low Al content in the supply layer reduces the density of DX centres. 
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Ino 15Gap g5As 


Alo.15Gaq.gsAs GaAs 


Conduction band diagram of a pseudomorphic HEMT structure with a pulse-doped barrier. 


Ss | D n*-InGaAs contact layer 
InAIAs barrier (higher AEc¢) 
Pulse INo.53Ga9,47As quantum well 


doping 


InGaAs/InAlAs graded superlattice 


Semi-insulating GaAs substrate 


Layer structure of a metamorphic HEMT. 


e The addition of In in the channel enhances the low-field mobility and, to a lesser 
extent, the peak velocity in the channel. 

e The added heterostructure towards the GaAs buffer reduces injection of carriers into 
the buffer and substrate. 


Higher A E¢ are possible with higher In concentrations in the channel. However, note 
that with increasing In content, the InGaAs layer thickness must be reduced. In practi- 
cal pseudomorphic HEMTs on GaAs substrates, xj, = 0.15...0.25. This increases the 


maximum density of the 2DEG to ns max & 2° 10!2 cm7?. 


Metamorphic HEMT 
Mobility in the channel would benefit from even higher In mole fractions, e.g. xj, = 
0.53, as in InGaAs lattice matched to InP. However, InP substrates are still considerably 
more expensive than GaAs wafers. 

The metamorphic HEMT concept enables high In mole fractions in the channel 
layer on GaAs substrates, through a modification of the lattice constant in a graded 
superlattice. Such a layer structure is shown in Figure 2.37. 
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An InAlAs/InGaAs superlattice with varied thickness and composition is grown on 
top of the GaAs substrate such that the effective lattice constant (modified by com- 
position and built-in mechanical strain) is adapted from that of GaAs to (in this case) 
the one of InGaAs with an In mole fraction of 0.53. The use of a low-temperature 
grown superlattice allows the change of the lattice constant while keeping the density 
of deformation-related crystal defects low. 

As in all modern HEMTs, the barrier layer is assumed to be pulse-doped. Another 
modification is the use of InAlAs instead of AlGaAs as barrier material. InAlAs has a 
higher conduction band discontinuity towards InGaAs than AlGaAs for comparable Al 
mole fractions; furthermore, the InAlAs/InGaAs heterostructure stack is easier to grow. 
As contact layer material, InGaAs is used here because it has a much lower Schottky 
barrier height than GaAs. 


CAD modelling of HEMTs 


Due to the similarity of the HEMT to MESFETs and (as we will see in the next section) 
MOSFETs, CAD models of these two devices are often re-used to simulate HEMTs. 

In the discussion of CAD modelling, we will go beyond the rather simple models 
provided for the MESFET and describe a high-accuracy semi-empirical approach. It is 
equally suitable for an enhanced precision model of the MESFET. 


Static current equations 

A HEMT-specific problem is the simulation of transconductance suppression at large 
Vas (see p. 81). A suitable drain current expression which accommodates this (the 
discussion follows I. Kallfass [27]) is 


Ips(Vas) = B (Vas — Vorr)*/+8 Yas Vort)) (2.57) 


in saturation — neglecting the Vps dependence of Ips. 
The non-saturated region at small Vps can be included using the tanh term already 
discussed in the context of the MESFET Curtice model: 


Ips = B (Vos — Vore)*/At§ Vas— Yor) tanh (a Vps). (2.58) 


In real devices, the drain-source voltage has a non-linear influence (so far neglected) 
on the current in saturation, e.g. through impact ionisation effects. In the non-saturated 
regime, on the other hand, the tanh (a Vps) expression is not always sufficient, because 
the Ves dependence is not adequately modelled. An effective voltage Veg is introduced, 
replacing the simple Ves — Vorr term in Equation (2.58): 

a 


1+nVpo +€ Vert 


Ips = B Vere tanh [a Vps (1 + ¢ Verr)] (2.59) 
1 7 3 

Vert = 5 Vast + \/ Vs, + 6 

Vast = Vas — (1 + B2) Vio + v Vos. 
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This expression, introduced by Cojocaru and Brazil in 1997 [8], is called the COBRA 
current equation. Its advantage is that it is continuous in the entire bias plane, and also 
its derivatives are continuous, which is very important for simulations of the non-linear 
behaviour of circuits. 

B,A, “,&, a, €,5, y and Vro are model parameters to be extracted by measurements. 
B; is equal to 6, but dimensionless. They affect [ps as follows: 


a, ¢ affect the linear regime of the device — aw is the main parameter modelling the 
Vps dependence; ¢ modifies the Vgs-dependent behaviour. 

B is the main transconductance parameter. 

€ is the parameter which adjusts the transconductance compression. 

y introduces a Vps dependence to the drain current in the saturated regime and is 
hence responsible for the output conductance. 

ju equally introduces a Vps dependence in the linear regime. It is used to model 
impact ionisation effects in the saturated regime. 

4 adjusts the curvature of Ips(Vcs) for small Vps and close to threshold. 

Vro is the threshold voltage for small Vps. 


The drain current source Ips = f(Vags, Vps) is embedded into an equivalent circuit to 
account for the series resistances and the non-linear gate-source and gate-drain contacts. 
This is shown in Figure 2.38. Note that the controlling voltages drop between the inter- 
nal nodes! The diodes, Dgs and Dgp, are used to model the non-linear gate current. 
Breakdown behaviour can equally be included here: 


Vas 
Ios (Vos) = Isgs («x = i) 


nia Vr 
Vas — Vbv \ Vas 
+ Ipy ex 2.60 
ae ( Nby Vr Vbv aot 
Vap 
Iop(Vep) = Isga | €XP eal 
nia Vr 
Vop — Vi Vi 
+ Iby exp ( = ~) ae (2.61) 
Npy Vr Vov 
Rg Pep Rp 
GOV Po WW\-0 
Des lps =f Ves, Vos) 
Ves Vos? 
Rg 
iS) 
Fig. 2.38 Quasi-static equivalent circuit used in the COBRA model. 
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where /,9, and /sgq are saturation currents for the gate-source and gate-drain diodes, 
respectively, and nig is the emission factor for these diodes. The second term in each 
equation models breakdown with an exponential diode term. Vpy is the breakdown volt- 
age and Jpy and mpy are used to model the current increase beyond breakdown. The very 
last product term simply makes sure that the breakdown current is zero, if either Ves or 
Vep are zero in Equations (2.60) or (2.61), respectively, but has no other major effect. 


Non-linear capacitance equations 
To properly model the non-linear behaviour in any FET, we need to account for several 
contributions: 


e parasitic (non-bias-dependent) capacitance, 
e the junction capacitance, 
e the change in channel charge with varying voltage. 


The first two are straightforward to model: the parasitic capacitance is Cpgs for the 
gate-source diode and Cpygq for the gate-drain diode. For the junction capacitance, the 
common form also implemented in SPICE is used: 


Co 


lo 
(1- %) 


where Co is the capacitance without any external voltage, Vp; is the built-in voltage of 
the junction and m is an exponent. 

Inclusion of the channel charge is much more complicated. For once, the channel 
charge depends on Vgs and Vcgp simultaneously. Then, charge conservation needs to be 
satisfied. This means [28] 


8Cos _ 8° Oc 8°Qc__ 8Cop 

d6Vcs  5VcsdéVGp  SVGpdVcs  dVcs_ 
Any empirical expressions for Cgs(Vgs, Vop) or Cgp(Vcs, Vap) must fulfil Equa- 
tion (2.62). 

Figure 2.39 shows gate-source and gate-drain capacitances experimentally deter- 
mined from S-parameter measurements, as a function of Vgs, for Vps values in the 
linear and the saturated regime of FET operation. Note the rather strong variation near 
pinch-off, and generally in the linear regime. 

In the following empirical equations [29], the tanh(x) function is again exploited, 
similar to the Curtice models. 


C(V) = 


(2.62) 


Ces(Ves, Ved) = Cpgs + 


+ Cgso {1 + tanh[« (Ves — Vi2)]} 
+ Cs(Vgs) {1 + tanh[u(Ves — Vep — Via) I} 


5Cs(Ves) ( 
VG 
dVcs 


1 
, In {cosh[t(Vgs — Vep — van) (2.63) 


More free ebooks : http://fast-file.blogspot.com 


Fig. 2.39 


2.3.8 


Electronic devices 91 


50 
Cos Saturated 
regime 
40+ 
30+ i 
" \\ Linear 
5 regime 
20 - Cop | leet 
a ee cc molarelee 
regime 
glu ; 
1.2 0.6 0 0.6 
Ves/V 


Experimentally determined Cgs and CgGp of pseudomorphic GaAs HEMT (Lg = 0.15 wm, 
Wo =2 x 20um). 


Cedi 


(8 
+ Coa2 {1 + tanh[« (Vep — Vis) I} 
— Cs(Ves) {1 + tanh[u(Vgs — Vep — Via) I} - (2.64) 


Cas(Vas, Vad) = Cpga + 


The capacitance 


Cs(Vas) = C3 ‘aA 


with 


1 
Vert = a (vas — Va t+ ¥ (Vos — Vis)? + 6?) 


is closely related to the drain saturation current (compare Equation (2.59)). 

Cgsi, Cgs2, Cgai, Cea2,m, Voi, Vi2, Via, Via, Vis, ¢, «, @ and yw are fitting parameters. 
The non-linear capacitances, along with an additional parasitic drain-source capaci- 

tance Cps and a parasitic channel resistance Rj, have been combined in Figure 2.40 to 

form a basic non-linear dynamic model of the HEMT. More bias-independent parasitic 

parameters may be added, as needed. 


MESFET versus HEMT: a small-signal comparison 


When the non-linear equivalent circuit in Figure 2.40 is linearised in a given bias point, 
the resulting small-signal equivalent circuit is identical to that derived for the MES- 
FET in the previous section, with the exception of the domain capacitance Cpc, which 
is often neglected anyhow. Many results obtained for the MESFET can therefore be 
directly applied. Rather than repeating the results here, let us discuss how the achievable 
small-signal performance differs between MESFET and HEMT. 
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A dynamic non-linear model of the HEMT. 


As discussed, the higher low-field mobility affects the series resistances which rep- 
resent semiconductor regions outside of the velocity-saturated channel. These are Rs, 
R; and, of lesser importance, Rp. Equally, it increases the transconductance gm (see 
Equation (2.53)). 

The larger potential barrier between the channel and the substrate reduces the output 
conductance gqs in the HEMT. 

These findings directly translate into a significant advantage in terms of the maximum 
frequency of oscillation, fmax: 


ft 


fi ax = : 
me 26/(Ra + Rs + Rigas + 27 ft RGCpG 


The gate resistance Rg is, of course, independent of the device structure. 
The HEMT structure also has a positive impact on the noise performance. This can 
be shown using the Fukui equation already introduced for the MESFET: 


Fmin = 1+ ey &m(Rc + Rs) 


ft 
R R 
Sigenpeeg 
&m 
using 
&m 
fr 21 Ces , 


The noise performance is improved not only by the reduction in Rs and the increase in 
8m. The fitting factor kp, which is typically 2.5 in MESFETs, decreases to kp = 1...2 in 
HEMTs. This is commonly explained by the higher correlation between channel noise 
and induced gate noise, and the reduction in channel noise due to the smaller degree of 
freedom of carrier movement in the 2DEG. 
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A practical HEMT example 


The example chosen here is a metamorphic HEMT [4] because it illustrates many of the 
concepts discussed. 

The device structure is shown in Figure 2.41. The rather thick (1 jum) linearly graded 
buffer adapts the lattice constant of the GaAs substrate to the much larger lattice con- 
stant of Ino.53Gao.47As and Alo.4gIno.52As (the ternary compounds are lattice-matched 
to each other). Note the ‘double doping’ structure — there are 6-doped AlInAs layers 
below and above the InGaAs quantum well. In this case, it allows a density of the 2DEG 
of Ns max = 4° 10! cm?, together with the excellent carrier confinement in the quan- 
tum well. The excellent confinement is due to the large conduction band discontinuity 
between Ino.53Gag,47As and Alg.4gIng.52As. 

The ohmic contacts are placed on an Ing,53Gao,.47As cap layer, which reduces the con- 
tact resistance and shields the metal—semiconductor interface from the Al-containing 
alloy, which is prone to formation of Al oxides at the exposed surface. The gate contact 
has a T shape which reduces the series resistance of the gate stripe and hence Rg. It is 
again shown in Figure 2.42. 

The cross-section of the gate metallisation is significantly larger than what would 
be possible for a simple stripe with a 250 nm footprint, due to the T-gate structure. 
A refractory metal is used here so that the gate can be fabricated before the ohmic 
contacts — this allows an easy self-alignment of the ohmic contacts with respect to the 
T-gate structure, minimising the distance between the source and drain contacts and the 
channel, reducing Rs and Rp. The surface between the gate and the ohmic contacts is 
passivated by a SiN layer. 

The plot in Figure 2.43 shows the drain current and transconductance of the device, 
normalised to | mm gate width, at Vps = 1 V, which is well into the saturated regime 
for this device. The actual gate width of the characterised device is 201m. The gm 
maximum is placed at Vggs = 0 — this is frequently done as it facilitates gate biasing. 
The threshold voltage is slightly below 0.6 V. The g,, depression at higher Vgg is also 
clearly visible. 


T-Gate 


SiN spacer 


AuGe-Au Ohmic contacts 


Ino.53 Gag 47 AS cap 


Alo.ag !Mo.52 As, 6-dotiert 


Za Ino.53 Gag 47 AS quantum well 
i um 


Alo.ag !No.52 As, 6-doped 


Linearly graded buffer 
Alo.ag Gag.52 AS > Alo.ag !No.52 As 


Semi-insulating 
GaAs substrate 


Layer structure of the metamorphic HEMT structure discussed here. 
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Refractory metal 
T-gate structure 
SiN passivation 
Self-aligned 
ohmic contacts 
Lg=253 nm 
Fig. 2.42 SEM micrograph of the gate structure (F. Benkhelifa, M. Chertouk, M. Dammann, M. Massler, 
H. Walther and G. Weimann, Jnternational Conference on Semiconductor Manufacturing 
Technology GaAs MANTECH 2001 Digest, May 2001). 
800 
700 
£ 
E 600 
7p) 
£ 500 
3 
+ 400 
£ 
£ 
=z 300 
£ 
6 200 
100 
0 alll 
dl 0.8 -0.6 -0.4 -0.2 0 0.2 04 06 08 1 
Ves/V 
Fig. 2.43 Drain current (/p) and transconductance (gm) of the metamorphic HEMT, normalised to 


1 mm gate width (F. Benkhelifa, M. Chertouk, M. Dammann, M. Massler, H. Walther and 
G. Weimann, International Conference on Semiconductor Manufacturing Technology 
GaAs MANTECH 2001 Digest, May 2001). 


Figure 2.44, finally, shows the short-circuit current gain 42, as well as the maximum 
available gain (MAG) and the maximum stable gain (MSG) as a function of frequency, 
on a logarithmic scale. The current gain rolls off with an expected —20 dB/decade, and 
the transit frequency fr, measured at |42;| = OdB, is 110 GHz. The extraction of the 
claimed fmax of 300 GHz is less certain. As is explained in Chapter 5, fmax can be 
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|hp,/dB, MAG/dB, MSG/dB 


fiGHz 


Short-circuit current gain |421|, and power gains MAG and MSG, as a function of frequency, for 
the metamorphic HEMT structure (F. Benkhelifa, M. Chertouk, M. Dammann, M. Massler, H. 
Walther and G. Weimann, International Conference on Semiconductor Manufacturing 
Technology GaAs MANTECH 2001 Digest, May 2001). 


extracted at the frequency where MAG = OdB. The problem is that MAG only exists 
where Rollet’s stability factor k > 1, otherwise it is replaced by MSG. The change 
in slope of the MSG/MAG curve suggests that the transition between MSG and MAG 
happens only above 100 GHz, close to the upper end of the measurement range. From 
there, fmax Seems to be extrapolated also at —20 dB/decade, even though the true roll- 
off is much steeper. The problem in determining fmax from MAG can be circumvented 
if an extraction from Mason’s unilateral gain u is used. This is also explained elsewhere, 
and leads to different values for fmax. 

The important finding, however, is that using a metamorphic HEMT with an optically 
defined gate of Lg = 0.25 wm provides sufficient gain for applications at 100 GHz. 


Radio Frequency MOSFETs 


Introduction 


The silicon MOSFET is by a huge margin the most popular transistor structure. Long 
confined to either digital circuits or lower-frequency analogue applications, it now 
makes significant inroads into the realm of micro- and millimetre-wave circuits. With 
gate lengths below 100 nm, its cutoff frequencies fp and fmax now rival those of the 
already introduced HEMTs or advanced HBTs, which will be introduced in the next 
section of this chapter. 

We will briefly review the fundamental aspects of MOSFET operation and then pro- 
ceed to the analogue aspects of RF CMOS operation which are commonly not covered 
in texts dealing primarily with MOSFETs as components in digital VLSI and ULSI. 
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The unparalleled success of silicon as the material of choice in fabricating electronic 
components is due to several factors: 


e Silicon is cheap and has an almost limitless supply. 

e Silicon is mechanically robust. 

e Silicon has a high thermal conductivity, at least compared to GaAs and InP. 

e But most importantly, silicon has a highly stable native oxide, SiO2, which forms a 
high-quality interface with silicon. 


This latter property led to the unequalled victory of metal-oxide—semiconductor (MOS) 
technology. 


Basic MOSFET structure 
Consider a somewhat schematic cross-section of a MOSFET (Figure 2.45). 

It is not intended to do justice to the complexity of modern MOSFET devices, but 
shows their fundamental components. This is an n-channel device — the current in the 
channel will be carried by electrons. We will focus on n-channel devices here as this 
facilitates comparison with the previously discussed MESFETs and HEMTs (which are 
almost exclusively n-channel devices), but all findings relate analogously to p-channel 
devices as well, with appropriate modifications reflecting the differences in doping and 
free carrier type. 

First, we note that the electron channel will actually form in a p-type semiconductor 
region, called the bulk. This can be either the substrate or a p-doped layer formed by 
epitaxy or diffusion. This will be explained in the next paragraph. Secondly, the stable 
SiOz is used in two different ways: 


(i) as a thin gate oxide which covers the surface between the source (S) and drain (D) 
contacts and carries the gate (G) electrode on top; and 
(ii) as a thick field oxide which covers the remainder of the structure. 


The source and drain contacts are non-blocking (ohmic). The gate is physically sepa- 
rated from the semiconductor by the gate oxide — this arrangement is called an MOS 
(metal oxide-semiconductor) diode, even though the gate electrode in modern MOS- 
FETs is actually not metallic, but fabricated from highly doped polycrystalline Silicon 


(poly-Si). 


Field oxide (SiO) Gate oxide (SiO,) 


p-type bulk (substrate or well) 


Basic structure of an n-channel MOSFET. 


More free ebooks : http://fast-file.blogspot.com 


Electronic devices 97 


MOS diode operation 

Let us now investigate how an electron channel can form in the p-type semiconduc- 
tor. To this end, we look more closely at the band diagram in a region below the gate 
electrode. Initially, no external voltages shall be applied to the device. 

We construct the band diagram of the MOS diode (Figure 2.46), using Anderson’s 
rule. 

In the n*-doped poly-Si gate, we assume that the conduction band energy Eg coin- 
cides with the Fermi energy Ep. The bulk Si is p-doped, and we calculate the distance 
between the Fermi level and the valence band energy Ey, assuming that the Boltzmann 
approximation to the Fermi—Dirac statistics is valid: 


Ep— Ey =kT ln (*) ; (2.65) 
Na 
where Ny is the density of states in the valence band and Na is the acceptor 
concentration in the p-Si. 
The SiO? is handled as a semiconductor with a very large band gap. 
The distance between the conduction bands and the vacuum level, Eyac is given by 
the electron affinities in the Si and SiO2 — xs; and xsio,, respectively. Note xsi > Xsio- 


poly-Si | SiO, | p-type Si 
Esc 


Ec=Er 


Fig. 2.46 Band diagram of an MOS diode structure. 
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The continuity of the vacuum level (postulated by Anderson’s rule), along with the fact 
that the poly-Si is n-type, makes the bands in the p-type Si ‘dip’ towards the SiO2/Si 
interface. 

We introduced two new potentials and their corresponding energies: 


(i) q Vp; is the energy difference between the Fermi levels for the intrinsic and doped 


semiconductor: 
E E 1 N N E N 
eS eh See A ear OO) 
2 2 Ne Na 2 Na 


(ii) gq Vs is the energy difference between the undisturbed semiconductor and the 
S1/Si0> interface. 


Using these two potentials, we can easily distinguish four different regions: 


(i) Ws < 0: The bands ‘bend upwards’ — accumulation of holes at the interface — 
creation of a positive space charge of mobile carriers there. 

(ii) O < Wy < Vp: Depletion of holes at the interface — creation of a negative space 
charge of fixed carriers. The charges are the ionised acceptor atoms. Increase of 
Ws results in an extension of the space charge layer into the semiconductor. In the 
limit Y, = Vp;, the interface behaves like an intrinsic semiconductor. 

(iil) Vpi < Ws < 2 Vp: As the Fermi level is now closer to the conduction band than 
to the valence band at the interface, the conduction type converts from p-type to 
n-type. This condition is called light inversion. 

The density of minority electrons at the interface increases exponentially 
with W,: 
np = ny et Ue VeD/ ET 


(iv) At Wy > 2Vp; the interface carrier density rises sharply for small changes in 


W,, which remains almost constant for large changes in the interface charge. 
This condition is called strong inversion. The width of the depletion region stays 


approximately constant at 
Za 
Wmax = 2,/——— Via, (2.67) 
qNa 


for a homogeneously doped semiconductor. 


In Figure 2.46, 0 < Wy, < Vgi, hence the device is in depletion without exter- 
nally applied voltages and no channel forms. This is typical of n-channel MOSFET 
transistors — they have positive threshold voltages, in contrast to MESFETs and HEMTs. 

As the gate electrode is isolated from the semiconductor by the gate oxide, we can 
apply large positive gate-channel voltages without creating a gate current, as would be 
the case in Schottky diodes. This situation is shown in Figure 2.47. 

Note that the conduction band forms a triangular potential well at the Si/SiOz inter- 
face. In this respect, the MOSFET is closely related to the HEMT and will equally form 
a two-dimensional electron gas in the channel. 
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vac 


q Vox 


Fig. 2.47 MOS diode band diagram with positive Vg. 


The externally applied gate voltage drops partially across the gate oxide, partially 
across the semiconductor and increases W,. In the situation drawn in Figure 2.47, the 
hypothetical intrinsic Fermi level already drops below the Fermi level in the undisturbed 
p-type semiconductor (WY, > Vp), but YW; > 2Vp; has not been reached — the structure 
is in light inversion. An even more positive Vg will introduce strong inversion. 

If Vox is the voltage drop across the oxide, 


1 Nv 
Vg = Vox + Us + — | kT In — —- Eg}, (2.68) 
q Na 


The last term in Equation (2.68) is the flat-band voltage Veg: 


1 Ny 
Veg =: — | kT In — — Eg}. (2.69) 
q Na 


It is called flat-band because for Vg = Vrp, Ys + Vox = 0 and the bands become 
completely horizontal. 
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Vox can be calculated from the gate capacitance Cox and the total charge stored under 
the gate Qs: 
_ Qs 

Cox 
The total gate charge is formed by the space charge in the depleted region Qpx and the 
interface charge Qj. 

Let us now consider the case where Y, = 2 Vp; just occurs (onset of strong inversion). 
Q; is negligible, and hence Q, © Qp with 


Vox = 


Op = —¢ NA Umax Lo Wa, 


where Wmax is the maximum extension of the space charge region, equal to the extension 
for UV, = 2 Va (see Equation (2.67)) and Wg Lg is the gate footprint, which determines 
the area of the channel. Hence, 


Op = —2 /esiq Na Vi. 


We can now calculate the threshold voltage Vj, as the necessary VG to reach the onset 
of strong inversion. Recalling that at this point ¥, = 2 Vp;, Equation (2.68) yields 


My =-2 


+ 2 Vai + Ves 


Ox 
2Wo Le 
=e vesi9 Na Vp + 2 Vai + Ves. (2.70) 
Ox 


In strong inversion, the oxide capacitance is easy to calculate, because the mobile charge 
Q; is concentrated as a sheet charge at the Si/SiO2 interface (compare the situation 
in the HEMT). The sheet charge forms a simple parallel-plate capacitor with the gate 
electrode: 


ee 
Cox = Wa Lo, (2.71) 
Ox 
where fox is the gate oxide thickness. 
The threshold voltage is then approximately:° 
2 tox 
Vin = Vesig Na Vai + 2 Vai + Ves. (2.72) 


€SiO> 


Drain current 


So far, we considered only the voltage between gate and channel, assuming that the 
drain and source electrodes are on equal potentials. Now, we apply external voltages 
Ves, Vos 4 0, as in Figure 2.48. As before in the discussion of MESFET and HEMT, 
we make certain assumptions for the channel: 


e The channel shall be one-dimensional, i.e. the electric field has only a z component. 


5 Because Vo = Vin, the MOS diode is not strictly in strong inversion. 
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MOSFET structure with externally applied voltages. 


e The channel shall be gradual, i.e. the current is driven purely by the electric field and 
diffusion is neglected. 


Now that Vps > 0, the voltage between the gate electrode and the semiconductor will 
depend on the z coordinate along the interface: 


Vo(z) = Ves — V(z). (2.73) 


Constant-mobility model 

As discussed already, the channel current can be calculated from the local mobile charge 
and the velocity with which it moves. In case of the MOSFET, the local charge gq; is the 
mobile interface charge which in strong inversion can be calculated simply from the 
oxide capacitance and the local gate-channel voltage VG(z): 


ESiO> 


gi(z) = [Va(z) — Vin]. (2.74) 


We initially calculate the charge velocity for the low-field case, assuming that un, = 
const and find 
€SiO> , aV(Z) 


[Vo(<) — Vin] Hn 


vf = W 
D(z) G a a 


’ 


where jz/, is the interface mobility, which is lower than the bulk mobility due to the 
imperfections of the interface plane. 
Applying current continuity, we know that 


L, pe="S 
Ip(z) = const = Ip = | Ip(z)dz. 
Le z=0 


Using parameter substitution and noting that V(z = 0) = 0, V(z = Loa) = Vps, we 
find 
€si0. Wa, 


v2 
— Vea = Va) Vong = |; 2.75 
pe ie Ly [ GS th) Vps 5 (2.75) 


Ip(Ves, Vos) = 


The above equation only holds as long as the channel is not fully depleted. Because 
V(z) increases monotonically with z along the channel, depletion of the channel will 
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start at the drain, when Vps reaches the knee voltage V,, in full analogy to the MESFET 


and HEMT. 
€SiO2 


gi(z = Lg) = (Vos — Vk — Vin) = 0 


yields 
Vi = Ves — Vin (2.76) 


For Vps > Vx, the channel charge no longer depends on Vpg in this simple model. With 
Vps = Vx and using Equation (2.76), we find from Equation (2.75) 


€sio, Wo 
m2 = ul (Vas — Vin)’, OT 


Ip(Ves) = 
xX 2LG 


for Vps > Vx. 
This simple model of the MOSFET static behaviour is often referred to as the 
Shockley model [57]. 


Backgating 
Another parasitic effect influencing the static performance needs to be considered. 
Because the MOSFET sits on a conducting silicon layer (the ‘bulk’, p-type for 
n-channel, n-type for p-channel transistors), the device is essentially a four-terminal 
device, where the bulk is the fourth terminal. We had implicitly assumed that the bulk 
layer would have a fixed potential, which is that of the source contact. In an integrated 
circuit, however, this cannot always be maintained. Therefore, we need to consider a 
second control voltage, the bulk-source voltage Vgs. 

Recall that we defined the threshold voltage via the potential difference between the 
bulk and the Si/SiO2 interface. This suggests a very simple way of accommodating 
backgating — by modifying the threshold voltage: 


Vin = Vio — v Ves. (2.78) 


Here, Vio is the threshold voltage without backgating (i.e. the one considered so far) and 
y is a fitting parameter [49]. 


Non-ideal effects in short-channel MOSFETs 
Channel length modulation. 
So far, we neglected another effect: the source and drain regions form n—p junctions 
with the bulk semiconductor. These n—p junctions necessarily create depletion regions, 
whose width depends on the voltage across the junction. In an n-channel MOSFET, the 
drain has a positive potential with respect to source. Assuming that the bulk is held on 
source potential (Vgs = 0), the drain-bulk region is therefore reverse-biased, which 
leads to an increase in the width of the space charge region there. 

Figure 2.49 shows this situation. The effective gate length Le is shorter than the 
‘drawn’ gate length Lg. The difference is Vpg-dependent due to the drain space charge 
region: 


Letp = Lg — AL(Vps). (2.79) 
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Schematic representation of channel length modulation due to space charge incursion into the 
channel. 


AL can be approximated as follows: 


1 2€ 
AL =-— Vos — V. ’ 2.80 
ae Na‘ DS off) a (2.80) 


where a is a factor which depends on the exact geometry of the MOSFET - 
a = 0.02...1. This formula only applies for Vps > Vogr, only then will a part of the 
channel close to drain be fully depleted. 

Recalling Equation (2.77), it is easy to see that the progressive reduction of the effec- 
tive channel length with increasing Vps will cause the drain current to increase with 
increasing drain-source voltage. This effect is most pronounced if the ‘geometrical’ 
channel length Lg is already small — hence this is a very important effect in high-speed 
MOSFETs with channel lengths Lg < 0.5m. This effect is called channel length 
modulation and is conceptually very similar to the Early effect which we will introduce 
for the bipolar transistor (see p. 122). 


Short-channel effect. 

The calculation of the threshold voltage (Equation (2.72)), assumed that the gate and 
the mobile sheet charge in the channel form an ideal parallel-plate capacitor: the 
total interface charge Q; appears, with opposite sign, at the gate charge Qg. In real- 
ity, some of the field lines emanating from the negative charge in the channel may 
also terminate on the source and drain areas. Due to the n—p-junctions, the deple- 
tion of the channel region progresses more rapidly than predicted from considering 
the gate potential alone, which leads to a reduction in threshold voltage Vin. This is 
the short-channel effect proper, which says that for otherwise unchanged technological 
parameters, the threshold voltage will decrease with decreasing gate length. The effect 
is more pronounced, the deeper the source and drain contact regions extend into the bulk 
material. 
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Looly = 60 nm 
1054 Wg = 200 nm 


Vpg=0.02V 


Drain current per unit gate width 
A/um 
3 
~“N 


Ves/V 


Example for the Vpg dependence of subthreshold currents in deep-submicron MOSFETs 
(T. Sugli, K. Watanabe and S. Sugatani, Fujitsu Science and Technology Journal, Vol. 39, 
No. 6, pp. 9-22, June 2003.). 


LDD extensions 


p* pockets 


MOSFET structure with a double implant LDD arrangement. 


As the shape, specifically of the drain side space charge region, depends on the poten- 
tial of the drain contact, it is not surprising that Vps also has an effect on Vin: as the 
drain-source voltage is increased, the drain field will deplete the channel more, which 
leads to a further decrease of the threshold voltage. 

The effect of the drain field can best be shown in the subthreshold regime (see Figure 
2.50). For Ves < Vin, the channel current does not actually cease to flow, because even 
before the onset of strong inversion, there are free charge carriers in the channel. Their 
density and hence the current depend exponentially on Ves — Vin. The figure shows an 
example of the subthreshold regime for a deep-submicron MOSFET, for two different 
values of Vps. We note that even a small increase in Vps increases Jor = Ip(Vgs = 0) 
but two orders of magnitude. 

A very common modification of the standard MOSFET structure which reduces 
the short-channel effect and channel length modulation, and also improves the break- 
down voltage, is the double implant lightly doped drain (LDD) structure [41] shown in 
Figure 2.51. 

The shallow n-doped drain extensions lower the maximum electric field in the chan- 
nel and hence increase the breakdown voltage, while the pt-doped pockets slow the 
growth of the p—n space charge regions into the channel with increasing Vps. 
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Mobility degradation. 

Short-channel devices benefit from an increase in the bulk doping concentration, 
because the problem with the high /of¢ and also the channel length modulation can 
be decreased by increasing the bulk doping. Together with the thin gate oxide, however, 
this significantly increases the electric field component in y direction. The charge carri- 
ers in the channel will then flow, on average, closer to the Si/SiO2 interface where their 
mobility is reduced by interface scattering, due to imperfections of the interfacial layer. 
Therefore, the effective mobility will decrease with increasing Vcs, approximated by 


n,0 
1+m (Ves — Vin)’ 


(Vas) = (2.81) 
where je: g is the mobility at threshold and m is a factor describing the degree of normal- 
field mobility degradation. 

In summary, in short-gatelength MOSFETs, the simple one-dimensional approach 
we started out with is no longer adequate, hence two-dimensional effects have to be 
incorporated into the physical simulation. 


Velocity saturation 
As in the typically GaAs-based MESFET and HEMT devices, velocity saturation at 
high electric fields also has to be considered here. The critical field €,,, for velocity 
saturation in silicon is ~4 - 10° Vm7! at room temperature, compared to 3 - 10° Vm7! 
for GaAs, so the onset of velocity saturation is delayed in Si versus GaAs. 

If we assume that the channel is fully velocity saturated, i.e. u; = Usat # f(z), the 
drain current becomes 


€SiO. 
Ip = Wo 2 


Usat Vos — Vin); (2.82) 


OX 


where Usat is the drift saturation velocity of silicon, which is approximately 10° m/s at 
room temperature. 

In the intermediate region, Lee [34] gives the following approximation for the drain 
current: 
ESiO> Usat 


Lo Esat : 
oe ist Vas—Vin 


Ip = We (2.83) 


for Vos > Vp,sat, Where Vp, sat is the necessary field for velocity saturation to occur in 
the channel, approximated as 


(Ves — Vin) Le Esat 
(Vas — Vin) + Le Esat 


Consider a modern MOSFET with Lg = 0.09um. Le Eat is 0.36 V, and assuming 
Vos — Vin = 0.5 V, we arrive at Vp sat = 0.21 V — considerably smaller than Vy = 
Vas — Vin = 0.5 V, as the Shockley model would predict. Velocity saturation is hence a 
phenomenon with significant importance in deep-submicron MOSFETs. 


Vp, sat © (2.84) 
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Maximum drain current (‘on-current’) Jon, gate oxide thickness Tox and supply voltage Vgq as a 
function of the technology node (T. Sugli, K. Watanabe and S. Sugatani, Fujitsu Science and 
Technology Journal, Vol. 39, No. 6, pp. 9-22, June 2003.). 


Equation (2.82) can also be used to estimate the maximum drain current in a given 
MOSFET family. In devices with Lg < 130nm, fx * 2nm typically. For a gate 
overtravel Vos — Vin of 1 V and Veat = 10° ms—!, we find Ip/Wco = 1.72mA pm—!. 

In order to increase the current for a given gate over travel, we have only two options: 


(i) Decrease the thickness fx of the gate oxide. However, as fox is reduced, the elec- 
tric field in the dielectric increases, and the gate-channel tunnel current increases 
dramatically. 

(ii) Increase the dielectric constant of the gate dielectric. This can be done by replacing 
the SiOz with a different dielectric, such as HfO2, which features eyfo, = 25 € 
instead of €sio2 = 3.9 €9, but with limited thermal stability. Additionally, a major 
advantage of Si, namely its highly stable native oxide, is given up. 


Figure 2.52 presents a literature data review [61] of achieved maximum drain current, 
gate oxide thickness and power supply voltage as a function of the target technology, as 
of the year 2003. The decrease in the maximum drain current (‘on current’, Jo,) and the 
supply voltage Vag give proof to the problems CMOS designers face due to the reducing 
gate oxide thickness. 


Large-signal modelling 


A MOSFET’s non-linear circuit (Figure 2.53), is very similar to what we discussed for 
the MESFET, or HEMT, except that the substrate (‘bulk’) node needs to be accounted 
for. The diodes Dgs and Dgp represent the source and drain p—n diodes, respectively, 
and include junction capacitance. The series resistances Rg, Rs and Rp are taken as 
bias-independent. 

As discussed in the previous paragraph, the drain current /p will be a function of Ves 
and Vps. In a more precise model, we need to make the threshold voltage a function of 
Vps and via backgating also of Vgs, so the bulk-source voltage needs to be included as 
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Non-linear equivalent circuit suitable for MOSFETs. 


a controlling voltage as well. Please note that the voltages occur between the internal 
nodes — voltage drops across the series resistances will have to be subtracted. 

A particular feature of MOS transistors are the overlap capacitances. Referring, for 
example, to Figure 2.45, note that the gate electrode overlaps the highly doped source 
and drain regions. Without this overlap, at least on the source side, the channel could 
not form, as the free charge in the channel is drawn from the source region — unlike in 
the HEMT, where the free carriers are being introduced through the supply layer on top 
of the channel. These capacitances will be bias-independent, so that the gate-source and 
drain-source capacitances can be written as 


5QB 
ereen — 
GS GSO + 5Ves 
r) 
Cop = Cepo + Os ; (2.85) 
5VcGp 


where Ccgso and Cgpo are the gate-source and gate-drain overlap capacitances and 
Qs is the total space charge, which includes the fixed charge (¢ Na w) and the mobile 
interface charge Qj. 

In strong inversion, the change in channel charge is reflected only in the interface 
charge: 


z=Lg 


5Qp © 6Qi = We 7 qi(z)dz. 


z=0 
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Equation (2.74) indicates that in strong inversion, the total interface charge will only 
depend on Vgs; therefore, Cgp will be given only by the overlap capacitance. Non- 
ideal effects can be included into the capacitance equation by making the threshold 
voltage Vin Vps- and Vgs-dependent. 

The Meyer capacitance model [38] already introduced for the MESFET is frequently 
used to model the bias dependence of Cgs and Cop. In strong inversion, 


e For Vps < Vk, 


2 Vi — Vos \* 
Ces =C Coc | 1 
GS Gso + 3CcGc (st = 4) 


2 
Cove excl Vi (2.86) 
GD = ©~GDO 3 GC Wy = Vos . 


e For Vps > Vk, 


2 
Cos = Ceso + 30ac 
Cop = Cepo. (2.87) 


where V, is the drain-source voltage delineating the linear from the saturated regime 
and (see Equation (2.76)) Cgc is the gate-channel capacitance for Vps = 0. In strong 
inversion, it is simply the total oxide capacitance (see Equation (2.71)). 

The gate-bulk capacitance can be similarly expressed; it models the effect of the bulk 
potential on the channel charge: 


In strong inversion, it can be neglected. 

The model may be extended with additional elements. Particularly, in RF designs 
the modelling of the impedance connected to the substrate node deserves particular 
attention. 

At the core of most submicron RF CMOS models is the BSIM3 model, devel- 
oped at University of California, Berkeley.° Unlike e.g. the COBRA model introduced 
for HEMTs, it uses different sets of equations for the MOSFET’s different operating 
regions. It is also more closely related to device physics, i.e. it is not strictly an empiri- 
cal model, and its input parameters are partly technological and partly empirical fitting 
parameters. BSIM’s complexity, however, is beyond the scope of a book like this. 

A simpler model approach, yet useful for many applications, was published by 
Sakurai and Newton [49]. The threshold voltage is 


Vin = Vo + v (\/208 — Vos - V26r), (2.88) 


where Vio, y and ®p are model parameters and Vgs is the bulk-source voltage — the 
above equation therefore includes backgating. 


© Web resource at www-device.eecs.berkeley.edu/bsim3/ 
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The drain-source saturation voltage, Vx, is modelled as 
Vi = K (Vos — Vin)”, (2.89) 


where K and m are model parameters. This formulation is more flexible compared to 
Equation (2.76) and allows to include velocity saturation effects. 
The drain current at Vps = Vx is 


Wo 
Ip,sat = ie B (VGs — Vin)". (2.90) 
€ 


where B and n are model parameters. With variation of n, we can address both the 
constant-mobility model (n = 2) and the constant velocity model (n = 1); however, as 
n is not Vps-dependent, we cannot move from one regime to the other. 
A Vgs-dependent n, on the other hand, would allow to include the deterioration of 
mobility at high electric fields normal to the Si/SiO> interface (see Equation (2.81)). 
The drain current formulation distinguishes between the saturated and non-saturated 
regions: 


e For Vps > Vx, 


Ip = Ip,sat 1 + A Vps), (2.91) 
where 4 = Ag — Aq Vas. 
e For Vps < Vk, 
Vi Vi 
Ip = Ip sat (1 +4 Vps) (2- 22) 2S. (2.92) 
Ve Vu 


Small-signal model and RF performance 


The small-signal equivalent circuit of the MOSFET is very similar to that used for 
MESFET or HEMT, but must account for the additional substrate node. 

Figure 2.54 shows that a fourth terminal (B) has been added, which is capaci- 
tively coupled to the internal gate, source and drain nodes. Cgg is shown to facilitate 
comparison with Figure 2.53; in saturation it is neglected. 


Re R 
C, D 
GOAAA | Cap = AKA —O 
wy) 
oN D 
Vgs | ——— ve ie 9ds —_——_ —— 
C, O mb “bs CG C 
GS BS | BG BD ‘ 
B 
| 
Rg Vos 
cS) 


MOSFET small-signal equivalent circuit. 
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Backgating is included by a special backgating transconductance: 


The transconductance in the constant-mobility limit and for Vps > Vx is 


_ dip _ €si0, Wa ky 
dVcs tox Lg 


&m (Vos — Vin), (2.93) 


using Equation (2.77). 
In the constant-velocity limit, 
€SiO 
&m = —— Wo Vsat. (2.94) 
fox 
The output conductance is 
bIp 


Bi = Ys" 


In the linear region (Vps < Vx), using the constant-mobility drain current Equa- 
tion (2.75), we find 


€sio, We Lh 
Le tox 


8ds = [(Ves — Vin) — Vos] - (2.95) 


In the saturated region (Vps > Vk), we use the Sakurai-Newton model Equation (2.91) 
as our simplified physical regions would predict gas = 0 there: 


BWo 
Sds = ALD, sat = A—— (Ves — Vin)”. (2.96) 
Lett 


To reconcile the Sakurai-Newton model with the constant-mobility model, choose B = 
(Eid, Mh) /tox, n = 2, Lett = Le. 


Transit frequency. 
We again approximate the transit frequency with 
&m 
TS. = 
Cas + Cap 


In saturation and using the Meyer capacitance equations (2.87), 


T= - 
Ceso + Capo + $Cox 


While recognising the importance of the overlap capacitances, let us assume for 
simplification that Cox dominates. 

For the transconductance, we need to distinguish between the constant-mobility and 
constant-velocity models. For the constant mobility, using Equation (2.93), we obtain 
=, ait 
~ 2 

4a Le 


St 


(Ves — Vin)- (2.97) 
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The transit frequency is predicted to increase linearly with the gate overtravel. However, 
this does not take the mobility degradation with increasing normal field into account (see 
Equation (2.81)). 

In the constant-velocity limit, the transconductance is given by Equation (2.94) and 
the transit frequency becomes 


3Usat 
= A 2.98 
fr in ie (2.98) 


We already recognised (page 105) that velocity saturation will dominate in short- 
channel MOSFETs, so we conclude that, like in MESFETs and HEMTs, fr will scale 
inversely proportional to the gate length for short Lg. 

For a given Lg, the transit frequency may still be increased by improving the mobil- 
ity, because velocity saturation will be reached sooner and the average velocity in the 
channel increases. 

Both electron and hole mobilities in silicon are enhanced if the silicon layer expe- 
riences a tensile strain in the plane parallel to the Si/SiO2 interface. Semiconductor 
heterostructures can be used to achieve this: on top of the silicon wafer, first a strain- 
relaxed Si;_,Ge, buffer is grown. As was discussed in Section 1.20, the addition of Ge 
lowers the band gap and at the same time increases the lattice constant. The latter effect 
is used here — if a thin Si layer is grown on top of the SiGe buffer, it experiences a tensile 
strain. 

Figure 2.55 shows an example of this strained-layer technique, here combined with 
silicon-on-insulator [2]. The use of silicon as the channel layer makes this structure fully 
compatible with existing gate technology modules. 

The axial tensile strain has a significant impact on the electron mobility, as is shown 
in Figure 2.56, at the expense of extra processing steps, and potential yield limitations 
when using non-lattice-matched materials. 

For p-channel MOSFETs, it is advantageous to place the channel into SiGe layers 
with significant Ge mole fraction. This will not be discussed here further, however. 


Strained Si 


Siq1-x) Ge, 


Strained Si 


Buried SiO, 


Si wafer 


Heterostructure-on-insulator layer stack on a strained-Si MOSFET (left), and corresponding 
TEM micrograph (TEM micrograph from D. A. Antoniadis, I. Aberg, C. NiCléirigh, 

O. M. Nayfeh, A. Khakifirooz and J. L. Hoyt, JBM Journal of Research and Development, 
Vol. 50, No. 4/5, pp. 363-377, April-May 2006. ©IBM). 
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Electron mobility enhancement in strained-Si layer (Data adapted from D. A. Antoniadis, 
I. Aberg, C. NiCléirigh, O. M. Nayfeh, A. Khakifirooz, J. L. Hoyt, JBM Journal of Research 
and Development, Vol. 50, No. 4/5, pp. 363-377, April-May 2006. ©IBM). 


Maximum frequency of oscillation. 

As already noted, the maximum frequency of oscillation fmax is the more meaningful 
figure of merit in analogue high-speed applications. As the equivalent circuit for the 
MOSFET is very similar to that used for MESFET and HEMT, we can easily adapt the 
max equation used there: 


St 
2./gas(RG + Rs) + 22 fr Ro Capo 


because in saturation Cgp = Capo. 


fi max = 


(2.99) 


The combination of strained silicon channels and ultrashort gate lengths enables cut- 
off frequencies above 300 GHz for n-channel CMOS. In a 65 nm technology, a device 
with Lg = 29 nm was reported to have an fr = 360 GHz and an fmax = 420 GHz [45]. 


Microwave noise. 
The treatment of noise in MOSFETs traditionally neglects the gate and source series 
resistances and considers only two noise sources [34]: 


e The spectral noise current density generated in the channel: 


(iia?) =8kT y gu, (2.100) 


where ggo = OIp/Vps at Vps = O and y is a parameter which varies from a 
value of 1 at Vps = 0 to 2/3 at Vps = Vx. This model is valid only in the linear 
region (Vps < Vx), and was developed for MOSFETs with long gate lengths. In 
short-channel FETs and in saturation, the observed spectral noise density can be sub- 
stantially higher. This can be accommodated by making the temperature T larger than 
the lattice temperature, to account for the significant kinetic energy of the free charge 
carriers. 


More free ebooks : http://fast-file.blogspot.com 


Electronic devices 113 


e The induced spectral noise current density of the gate current: 


(lig?) = BkT 5 gp, (2.101) 
where 
w Che 
B= Fay 
&d0 


In long channel FETs, 5 = 4/3. 

As discussed for MESFET and HEMT, these two noise sources are partially corre- 
lated; in the long gatelength limit, the correlation coefficient is c = j 0.395. From these 
noise sources, the minimum noise figure can be calculated to be 


1 — |c|?). (2.102) 


Because fr in velocity saturation scales ~1/LG, we expect the noise figure to vary 
linearly with the gate length. 

An additional noise contribution can come from the substrate. The conducting sub- 
strate can be lumped together into a single value, the so-called spreading resistance, 
Rsypb. This resistance naturally creates thermal noise, with a spectral noise current 
density: 

(Kiwi?) = = (2.103) 
Rsub 
This noise current can be capacitively coupled into the transistor via the bulk node. It 
also leads to a voltage drop across Cgs, which will create an additional drain current 
fluctuation via the backgating effect (see Figure 2.54). 

The issue of the source and gate series resistances needs to be re-examined. Modern 
MOS devices have poly-Si gates. Even highly n-doped poly-Si has specific resis- 
tivities which are much higher than for metal films. In RF CMOS technologies, 
the way around this problem is to connect many very short gate fingers in paral- 
lel, e.g. 40 gates of 5\1m gate width each, for a total Wg of 200m. As the gate 
length is more and more decreased, the gate resistance still needs to be recognised 
with, as Figure 2.57 [48] demonstrates. In this experiment, the noise figure is no 
longer decreased for LG < 0.5\1m, due to the increase in gate series resistance. 
Improved gate processes are therefore an important aspect in RF CMOS technology 
development. 

As the gate oxide thickness fox decreases, the gate current due to Fowler—-Nordheim 
tunnelling increases strongly. It generates a shot noise contribution [43], which will 
have to be accounted for in future MOSFET noise models. If Jc is the gate current, then 
the spectral noise current density generated is 


(Iie?) =4qIc. (2.104) 
This gate current leads to an additional term in the Finin expression [19], compare 


Equation (2.102): 
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Noise figure versus gate length for n-channel MOSFETs (40 fingers of WG = 5 wm each) 

(M. Saito, M. Ono, R. Fujimoto, H. Tanimoto, N. Ito, T. Yoshitomi, T. Ohguro, H. S. Momose 
and H. Iwai, IEEE Transactions on Electron Devices, Vol. ED-45, pp. 737-742, March 1998. 
©1998 IEEE). 


2qIGgaoy 


eater f2 CR (2.105) 


Fon = 14-£ | Pa - ce) + ————— 


For low frequencies, the second term under the root dominates and the minimum noise 
figure becomes independent of frequency: 


1 2qIcgaoy 


Fin © 1+ — 2.106 


The appearance of a frequency-independent component in Fmin(/) is a tell-tale sign of 
gate-related shot noise. 


A gate current due to Fowler—Nordheim tunnelling is expected to vary with the 
normal electric field across the gate oxide as 


3 

2 

Ig ~ EF sion exp (-2 , (2.107) 
y, SiOz 


where ¢ is the barrier at the interface. It will therefore be strong function of the gate 
overtravel Vgs — Vin. The occurrence of gate leakage has thus also important impli- 
cations on the design of low-noise amplifiers using sub-100nm CMOS technologies, 
namely in the choice of the bias point. 
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Bipolar and hetero-bipolar transistors 


Despite the dominance of MOSFETs in digital circuits, and the significance of HFETs in 
micro- and millimetre-wave ICs, bipolar transistors have made a strong comeback since 
the 1990s in high-speed analogue electronics, which is particularly due to the arrival of 
the Si/SiGe heterostructure bipolar transistor (HBT), but also due to widespread use of 
GaAs-based HBTs in power amplifiers, e.g. of mobile phone handsets. 

One important advantage of bipolar devices is that the current flow is vertical rather 
than lateral in FETs. This means that the critical geometric dimension (the base layer 
thickness) is defined by epitaxy or ion implantation. In FETs, the speed-limiting geom- 
etry is the gate length which, in present commercially available devices, is defined 
laterally by lithographic means, at a much higher cost. 

We will approach the understanding of HBTs by first considering the standard homo- 
junction homojunction bipolar transistor (BJT), which has long been a corner stone of 
high-speed electronics, but is gradually being replaced by Si/SiGe HBTs. In particular, 
we will get a grasp of the shortcomings of the homojunction BJT and how they can be 
solved by the introduction of bandgap engineering. The HBT is then a straightforward 
extension of the bipolar transistor concept. 


Homojunction bipolar transistors 


Homojunction bipolar transistors are the ‘classical’ bipolar transistors, where all parts 
of the device are fabricated from the same semiconductor material. Only silicon devices 
have any market relevance today; for the discussion of high-speed electronics, we can 
restrict our considerations to n—p—n type devices for reasons which will become clear 
shortly. 

With a suitable permutation in indices, the discussion of n—p—n homojunction bipolar 
transistors is also valid for p—n—p devices, however. 

Figure 2.58 shows the time-honoured’ one-dimensional representation of a con- 
ceptual bipolar transistor of the n—p—n type: the emitter layer is highly donor-doped 


S © 


<+— Ip <— 
Vee > 0 Vop > 0 
Emitter Base Collector 
fe n=Npe p=NapB n=Npc Io 


© VY 


Simplified schematic cross-section of a BJT. 


7 This schematic picture actually dates back to Figure 3 of Shockley’s US patent [56]. 
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(n-type), followed by an acceptor-doped (p-type) base layer of medium doping density 
and a typically lower doped donor-doped (n-type) collector layer. 

The emitter, base and collector contacts are non-blocking (ohmic) contacts and are 
assumed here to be of the recombination type. As shown, two external voltage sources 
Veg and Vcp are connected to the device in such a way that the 


e base-emitter p—n junction is forward-biased and 
e base—collector p—n junction is reverse-biased. 


This mode of operation is called active forward operation. 


Diffusion triangle 

To understand the way in which we control current in the bipolar transistor, let us 
concentrate initially on the base layer only. The first parameters to introduce are the 
diffusion length of minority charge carriers in the base; in this case the diffusion length 
of electrons (the base is p-type) Ly, and the thickness of the neutral base layer Wg. 
Wg is the thickness of the neutral base as we have to subtract the space charge regions 
first. We calculate the diffusion length from the low-field carrier mobility jz, the carrier 
lifetime t, and the absolute temperature T. In the n—p—n transistor, the minority charge 
in the base are electrons and hence we have to use the electron mobility jz» and the 


electron lifetime in the base t;,n: 
kT 
Ly = goo (2.108) 


where k is Boltzmann’s constant and g the elementary charge. 
The term 


kT 
Da = ie (2.109) 


is the diffusion constant for electrons, hence 


Ly = JV Daten. 


Equation (2.109) is the Einstein equation introduced earlier, in Equation (1.79). 

The emitter—base junction is forward-biased. Therefore, the minority (here, electron) 
concentration in the base immediately adjacent to the emitter—base space charge region 
(defined as y = 0) is elevated according to 


2 
n: 
np(0) = 2 eVBE/KT (2.110) 
N, 
A,B 

where 7;,p is the intrinsic carrier density in the base. 

Provided that the p-layer is infinitely extended in the y direction, the excess minority 
carrier density decays as 


2 


fib ny —y/L 
Np(y) = ia + | np(O) — Nea Coe (2.111) 
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Minority carrier concentration np in the base of an n—p-n bipolar transistor, as a function of 
coordinate y perpendicular to the surface. 


In the bipolar transistor, however, the base width Wg is made much shorter than the 
diffusion length:® 


Wz < Ln. 


It is instructive to estimate a numeric value for L,. Both zy and T,., are strong functions 
of doping. Let us assume that for a reasonable base doping concentration, T,,, = 1 ws 
and fy = 500 cm? (Vs)7!. Then the diffusion length amounts to Ly = 361m, which is 
certainly much larger than the base width in any microwave bipolar transistor. 

The reverse bias across the collector—base junction will cause the minority carrier 
at the collector side of the neutral base (y = Ws) to be significantly smaller than the 
minority carrier density in the undisturbed semiconductor: 

iy -aVep/kT 
np(Wp) = a, <* qVen/kT | (2.112) 
AB 

Provided that Wg < Ln, which is equivalent to neglecting recombination in the base, 
the distribution of minority carriers in the base as a function of the coordinate y (which 
is perpendicular to the surface of the device) is a linear function (see Figure 2.59). Due 
to its geometric shape, it is sometimes referred to as the diffusion triangle. 


Collector current equation 
In the classic bipolar transistor, the current is carried through the base layer by diffusion 
only, because the electric field in y direction in the neutral base can be neglected. 

We formulate the electron current density flowing through the base layer as a 
diffusion current: 


dn n,(0O) — ny (Wp) 
Jn.p = Dn 3 = qD,>—_? ~ 


Np 0 
y 


: (2.113) 
Wea Wp 


8 The so-called short-base diode condition. 
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because np(0) >> np(Wg). Dn is the diffusion constant of electrons in the base, which 
has already been defined in Equation (2.109). 

Note that due to dnp(y)/dy = const, the current is constant throughout the base. We 
obtain the collector current by inserting Equation (2.110) into (2.113) and multiplying 
with the emitter area AR: 


2 
n- 
Ic = qAEDy aie : ene. (2.114) 


still considering np(Wg) « np(0), i.e. under reverse collector-emitter bias. Without 
this condition, we obtain 


2: 
ne 
as gAEDa (eVau/"s = eVeui¥r) (2.115) 


The expression in the denominator of Equation (2.115) Wg Na_.p is the Gummel number 
of the base layer, Gg. In the above example and also below, we assume that the base 
doping concentration is constant across the neutral base. If this is not the case, i.e. 
Nap = f(y), we calculate the Gummel number as the integral sheet charge in the 


base: 
Wp 
GER = Na p(y)dy. (2.116) 
0 


Ideal base current 
We will now calculate the ideal base current of a bipolar transistor, i.e. the base current 
without components due to recombination. 

For this, we consider the emitter layer as a short-base diode also: WE < Lp, where 
Wg is the emitter width and Ly is the diffusion constant of minorities (here, holes) in 


the emitter: 
Lp = V Dptrp, (2.117) 


where 7,,p is the carrier lifetime of holes in the emitter and 


kT 
Dp = ie (2.118) 


is the diffusion constant for holes. jp is the hole mobility. 

Here, the minority carrier density is assumed to be zero at the emitter contact (ideal 
recombination contact). The result is a linear dependence of the minority carrier density 
Pn ON y in the emitter (see Figure 2.60). 

In analogy to Equation (2.113), we write the hole current in the emitter as a diffusion 
current: 


2 
n: 
Jpg = sy Ge (2.119) 


where nije is the intrinsic carrier density in the emitter. Multiplication with the emitter 
area Ag yields the base current: 


2 
ne 
eS q Debye (2.120) 
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Minority carrier concentration in a bipolar transistor, assuming short-base condition in emitter 
and base layer. 


Ideal current gain 
We can now calculate the ideal large-signal forward current gain of the bipolar 
transistor, dividing Equation (2.115) by (2.120). 


2 
Ic — DoWe Noe "ib 
Tg = Dp Wp Nas ne 


Br= (2.121) 
Let us consider the third term in Equation (2.121) first: in a homojunction transistor, 
where emitter and base are composed of the same material, we can assume Nj,p © Nie. 
This is not exactly true because the band gap and hence the intrinsic carrier density also 
depend weakly on the doping concentration, but it is a useful simplification. 

However, we also note that provided we can fabricate the emitter from a different 
material, it should have a larger band gap (and correspondingly a smaller intrinsic carrier 
density) such that n;,5 >> ni,e. This wide-gap emitter is the fundamental idea behind the 
HBT which we will treat in the next section. It was already included in Shockley’s 
original transistor patent [56] and theoretically expanded upon by Kroemer as early as 
1957 [31]. 

If we consider the second term, we recognise that we cannot arbitrarily increase the 
base doping concentration unless we also increase the emitter doping concentration, 
without hurting the current gain. This will lead us to the fundamental limitation of the 
homojunction bipolar transistor — the inability to lower the base resistance sufficiently 
for excellent microwave operation. 


Non-ideal current contributions 
In certain bias conditions, we will have to include additional currents in our considera- 
tions. For this, it is instructive to view Figure 2.61. 

Especially at low collector currents, recombination currents can frequently not be 
neglected: in bipolar transistors fabricated in direct bandgap semiconductors such as 
GaAs, the carrier lifetime may be so short that the short-base diode condition (Wg « Ly 
in case of an n—p-n transistor) is never quite fulfilled, so that volume recombination in 
the neutral base has to be accounted for. In other devices, surface recombination near 
the emitter—base p—n junction may play a significant role. These effects are all lumped 
together in a current contribution Jp, which in an n—p-n transistor exists as an electron 
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Schematic representation of electron (solid shaded) and hole (hatched) currents in a bipolar 
transistor. 


current (contributing to the emitter current) and a hole current (contributing to the base 
current). 

To account for the recombination current, we add an additional term to the base cur- 
rent Equation (2.120). It has the form of a diode current term with its typical exponential 
voltage dependence: 


I 
Ig = = + Ise(expt V5"/MRAT) — 1) (2.122) 
F 
The emission coefficient Np is larger than 1. NR = 2 is a typical value for many 


recombination processes and Jspr is the saturation current of the non-ideal base current 
term. 

For high Veg, electrons in the collector space charge region may gain sufficient 
kinetic energy to elevate a valence electron into the conduction band when colliding 
with a lattice atom — impact ionisation occurs. The charge carriers created by the impact 
ionisation may again gain sufficient kinetic energy in the strong electric field to cause 
impact ionisation themselves, leading to a strong increase in current. This is called 
avalanching and is an important breakdown mechanism in bipolar transistors. 

Avalanching is accounted for through an additional current term J,,, which exists as 
an electron current adding to the collector current, and a hole current, which subtracts 
from the base current. This is again shown in Figure 2.61. 


Non-ideal current gain 
The non-ideal current gain can now be written as 


Ineb — IR +L, 
R= neb R av 


= . (2.123) 
Tpbe + IR = Tay 


Even more insight is provided if we express Equation (2.123) in the form of a common 
base current gain. 
—Tlp _ B-1 


A= —— 
Ic B+1 
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Minority carrier distribution in the base layer in the presence of recombination. 


— Theb . Theb ion IR . (: rs Tay ) (2.124) 
Theb + Thbe Theb Theb — IR 

In Equation (2.124), the first product term is the emitter efficiency yg, which describes 

the electron current from the emitter to the base normalised to the overall current 

across the emitter—base junction. The second term is the base transport factor ay, 

which describes the ratio of electron currents across the base—collector and emitter—base 


junctions. The last term is the impact ionisation factor am. 
Equation (2.124) can hence be rewritten as 


A=yYe-aT- Om. 


Let us dwell on a7 for a moment. If recombination in the base cannot be neglected, then 
the minority carrier concentration in the base becomes 


2 | sinh ( tyat) 
Nn: Ln Ce - 1) + 


sinh (+) 


sinh( “8 ) 


iB 
Na | sinh(¥®) 


ap(y) = e AVeB/kT | (2.125) 


This is depicted in Figure 2.62. 
ay can be interpreted as the ratio of the minority carrier gradients at y = Wp and 
y=0: 


oe (y = Wa) 
= —: (2.126) 
Np _ 
FE (y = 0) 
In the presence of recombination, the base transport factor is therefore always less than 
one. 
Saturation 


In the above discussion, we assumed that the base—collector space charge region was 
reverse-biased — no carriers were injected into the base from the collector. We will 
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Base minority concentration under saturation conditions. 


now drop this condition and allow the base—collector junction to also become forward- 
biased. In the n—p-n transistor considered here, this means Vop < 0. The corresponding 
bias condition is called saturation. 

We can conveniently treat this condition again using the diffusion triangle concept 
introduced in Figure 2.59. Again, recombination across the base is neglected. 

Because charge carriers are being injected into the base across the forward-biased 
collector—base junction, the minority carrier density in the neutral base adjacent to the 
collector—base space charge region is elevated. For the n—p—n transistor, 


2 
n: 
ny(Wg) = ea Ven/kT (2.127) 
NAB 
The resulting diffusion triangle in the base is shown in Figure 2.63. 
The ideal collector current under saturation conditions can once again be calculated 
using a pure diffusion current ansatz in the base: 


2 
dnp(y) — Nib (c# Vaw/tT _ e-4 Ven/kT ) (2.128) 


Ic=qD 
Cc quan dy q "Na pWe 


For inhomogeneous base doping profiles, replace Wg Nap by the integral Gummel 
number Gx according to Equation (2.116). 

For a fixed base-emitter voltage, the collector current will now strongly decrease with 
decreasing collector—base voltage, while in the initial discussion of the active forward 
regime (see Equation 2.115), Jc did not depend on Vog. 

In high-speed circuits, the saturation regime has to be carefully avoided due to charge- 
storage effects whose detailed discussion is beyond the scope of this book. 


Early effect 

Upon closer examination, the collector current will show a dependence on Vcg even in 
the active forward regime. When deriving Equation (2.115) and the following equations, 
we had assumed that Wg was constant. 
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However, consider that Wg is defined as the width of the undepleted (neutral) region 
bordered by the emitter—base and the base—collector space charge regions. Because 
the collector current depends exponentially on the base-emitter voltage, Vgz ~ const 
across a wide range of collector currents, and the width of the base-emitter space charge 
region can be considered constant. 

As we change the voltage across the base—collector p—n junction, however, the width 
of its space charge region will be modulated, which leads to a variation in Wg. 

If for simplicity we assume homogeneous doping profiles in base and collector 
(Na.B = const, Np,c = const), the extension of the base—collector space charge region 
into the base layer is 


N. V \ 
ae 78 _ND,C | p+ VcB (2.129) 
q Nap Naspt+Np,c 


where €g is the dielectric constant of the base layer material and Vp the built-in voltage 
of the base—collector p—n junction. 

The diffusion triangle representation in Figure 2.64 may be helpful again. The dark- 
shaded areas denote the initial space charge regions. If Vcp is increased (with Vgg = 
const), the base—collector space charge region will expand as indicated by the light- 
shaded areas. Correspondingly, the minority carrier gradient in the neutral base will 
increase as the neutral base width shrinks. 

Hence, the collector current will increase with increasing Vcg. This is the Early effect 
[15]. 

More quantitatively, revisit Equation (2.115), as we consider only Vcgp > kT/q. 
However, now Wg = f(Vcp): 

2 


ne 
Ic = qApD, ——_»* (2.130) 
2 " We (Ves) Nap 


We differentiate Equation (2.130) with respect to Vcp: 
dIc Ic dW, 


= . ; 2.131 
dVcp Wp dVcp ( ) 


n 
Emitter 4 Base Collector 


y 


Modification of the base diffusion triangle due to the modulation of the base—collector space 
charge region. 
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Wp can be written as Wa — dyp, where We is the layer thickness of the base, i.e. without 
subtracting the space charge regions.” Hence for small changes in the neutral base width 
(Wh = Wp), 


dIc _ Ic dW, _ Ic déyg _ Ic ep Np cNap 1 
dVcp We dVcp WedVcp- NapWs\ 2q Np.c+ Nap Vo + Vos 
(2.132) 
With the Gummel number for homogeneous doping concentration in the base, 
Gp = Na.BWp 
and the base—collector capacitance per unit area 
,  _ [€Bq NaBNp,c 1 
Aen 2 Nap+Np,c Vo + Vos 
it follows that 
dIc Ic 
=—, 2.133 
dVcpn- Va : 
where Va is the Early voltage: 
-G 
Va = 8. (2.134) 
Cics 


The Early voltage is therefore directly proportional to the base Gummel number. 

In microwave electronics, the Early voltage is an important factor because it affects 
the linearity of power amplifiers. For highly linear power amplifiers, a high Early 
voltage is desired, as it reduces the dependence of the collector current on the collector— 
emitter voltage. Due to the exponential dependence of the collector current on the 
base-emitter voltage, Vgg is approximately constant even for large variations of Ic, 
so that d)Vck © dVcp. A high Early voltage Va hence reduces the dependence of the 
collector current on the collector—emitter voltage. 


Kirk effect 
The last intrinsic effect to be discussed here is the Kirk effect. It can be once again 
explained using the diffusion triangle (see Figure 2.65). 

Despite the fact that the minority carriers traversing the base are being injected into 
the base—collector space charge region, we had so far assumed that the space charge 
itself remains unaffected. This is true as long as the density of mobile charge is much 
smaller than the density of fixed charge. 

If we increase the collector current sufficiently to create a density of mobile charge 
comparable to the fixed space charge density, the mobile charge will start to compensate 
the space charge and the space charge region will shrink. 


° Well, actually we have to subtract the extension of the emitter—base space charge region into the base, but 
that can be neglected here. 
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Schematic representation of the Kirk effect influence on the diffusion triangle. 


If the width of the space charge region decreases, the neutral base zone will expand — 
this is called base push out. Correspondingly, the minority carrier gradient and with it 
the collector current for a given Vpg will decrease. 

To get an estimate of the collector current necessary to cause the Kirk effect, let 
us assume that the free charge (electrons in our case) will be accelerated to their drift 
saturation velocity Usat immediately after they enter the base—collector space charge 
region. Then the electron density nc corresponding to a collector current density Jc is 


Jc 
q- Usat 
If we now require that the fixed ionised donors in the collector of density Np,c shall be 


fully compensated by the mobile charge (7c = Np,c), we find for the critical current 
for the onset of the Kirk effect: 


nc = 


Jo.Kirk = 9 + NDC - Usat- (2.135) 


The onset of Kirk effect hence scales proportionally with the collector doping concen- 
tration. 


Small-signal dynamic behaviour 


Next, we will discuss the dynamic behaviour of the bipolar transistor for the small- 
signal case, where the non-linear relationships between the terminal currents and the 
voltages can be described as linear relationships between the deviations of these entities 
from a given bias point, i.e. 


ic = 51 c, ip = STB, Vbe = SVbes Uce = SVcE.- 


The forward-biased emitter—base junction will now be described by a conductance: 


dUc+Ip) — dig 
dVgpz sé‘ BE 


where ge will be referred to as the dynamic emitter conductance. The minus sign is 


= ge, (2.136) 


due to the convention that all currents Uc, Jp, Je) are counted positive flowing into the 
transistor. 
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From Equations (2.115), (2.120) and (2.121), we conclude that 


2 
n: 1 
Tg = — (Uc + Ip) = —GAgD, 2 eMBE/ Yt ( 1 + — 2.137 
E Uc + Ip) qAz "Gp° BF ( ) 


in the active forward regime, neglecting any non-ideal current contributions. 
Differentiating Equation (2.137) with respect to Vgg, we find the dynamic emitter 
conductance to be 


= (2.138) 
&e = Ve : 
It can be written as the dynamic emitter resistance: 
Vv 
ee (2.139) 
Tg 


The modulation of stored charge in the neutral base results in the diffusion capacitance. 
In the short-base diode limit, for a homogeneously doped base layer, and if we neglect 
carrier injection from the collector into the base (i.e. for Vcp sufficiently high), the 
stored minority charge in the base can be easily calculated, refer again to Figure 2.59: 


W n2 
Op = qAp— Van! Vr (2.140) 
2 Nas 


Differentiating Op with respect to Vgg results in the diffusion capacitance Cg: 


2 
, = FQp gags Tip | Vee / Vr 
dVBE 2 Nap Vr 


(2.141) 


Considering Equation (2.137) and recalling that in the homogeneously doped case, the 
base Gummel number is Gg = Na.p Ws, we find that Equation (2.141) can be written 
as 


(2.142) 


The ratio of diffusion capacitance to dynamic emitter conductance is bias-independent 
(at least in the active forward regime) and is called the base transit time Tg: 


_ Ca We 
~ ge 2D 
Recall that Dy = (kT /q)fn Equation (2.109); therefore, the base transit time is 
inversely proportional to the minority carrier mobility in the base. As the electron mobil- 
ity (4y 1s significantly larger than the hole mobility jp, in Si as well as the most common 
compound semiconductors, this justifies the restriction of our discussions to n—p—n-type 
transistors. 

The output conductance is related to the Early effect. We find it by differentiating Jc 
with respect to Vcr: 


TB (2.143) 


dic dic 
&ce = = , (2.144) 
dVcg dVcp 
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for constant VBE . Therefore, 
ce = . (2; 145) 
& V 


The emitter—base and base—collector p—n junctions also present junction capacitances 
which we have to take into account. The calculation of the junction capacitance can be 
reduced to the problem of calculating the width of the depletion layer w, because the 
capacitance is that of a parallel plate capacitor with area Ay and plate separation w: 


Aj 
Cy =é€s—. (2.146) 
w 
The depletion layer width is calculated from Poisson’s equation: 
d*® 
gf = 00) (2.147) 
dy? y 


where ® is the potential across the junction. A simple analytic solution can be found 
assuming that within the space charge region the mobile charge can be neglected com- 
pared to the fixed charge (i.e. the ionised donor density NB (y) on the n side and the 
ionised acceptor density N, (y) on the p side), and that within the depletion layer all 
doping atoms are ionised, while outside all doping atoms are neutral. The total potential 
difference across the depletion region must be equal to the sum of built-in (or diffusion) 
voltage Vp and the externally applied voltage Vext. 

For the simple case of homogeneous doping on both p and n sides (Np(y) = 
const, Na(y) = const), we obtain 


és NAtM 
ie fo i Veil. (2.148) 
q NaNp 


Note that V.x_ is defined as a reverse (depleting) voltage here. 

In active forward operation, the emitter—base diode is forward-biased (Vext = — Vg), 
while the base—collector diode is reverse-biased (Vext = Veg). In practical transistors, 
the emitter—base junction area Ag will also be different from the base—collector junction 
area Ac, so that we obtain for the n—p—n transistor: 


q:€s NapNps 1 
CpE = AR for Vgg < Vp (2.149) 
' 2 Nap+Np.5 Vp — VBE 
q:€s NaBNp.c 1 
Ccp =A ; 2.150 
oe </ 2 Na+ Np,c Vo-+ Vee : 


Finally, we have to account for the time lag associated with the transit of free charge 

carriers through the base—collector space charge region — the collector transit time.!° 
The calculation of the collector transit time is not as straightforward as it may seem, 

as it in principle needs both diffusive (at the base side edge of the space charge region) 


'0 Because the emitter—base diode is forward-biased, its depletion region is very thin and the associated transit 
time can be safely neglected. 
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Small-signal equivalent circuit of the intrinsic transistor. 


and drift transport components. The field-dependent velocity also needs to be taken into 
account, as well as the displacement current created by charge moving within the space 
charge region. 

A common assumption is that the carriers reach their drift saturation vsat instan- 
taneously after entering the space charge region. Then, the collector transit time 
Tc is 


Wc 


T= : (2.151) 
2 + Usat 


The small-signal components of the intrinsic transistors, which we considered above, 
can be combined in the small-signal equivalent circuit shown in Figure 2.66. @ is the 
small-signal common base current gain in the quasi-static limit. 

Figure 2.66 applies only to the intrinsic transistor and will have to be extended by 
extrinsic parasitic components at microwave frequencies. Most importantly, we have 
to account for the series resistances. In order to appreciate the problem, please refer 
to Figure 2.67, which presents a more realistic cross-section of the bipolar transistor, 
compared to Figure 2.58. 

The emitter resistance Rx (not to be confused with the dynamic emitter resistance re) 
is composed of the emitter contact resistance and the vertical resistance of the emit- 
ter layer, which in homojunction transistors typically is a poly-Si plug. The collector 
resistance Rc is formed by the collector contact resistance, the vertical resistance 
of the collector ‘sinker’ implant and the lateral resistance of the subcollector layer. 
The base resistance Rg finally is formed by the base contact resistance, the lateral 
resistance of the extrinsic base and the lateral resistance of the intrinsic base layer. 
Of these individual series resistance contributions, the lateral resistance of the intrin- 
sic base layer is the most problematic, as the thickness of this layer is given by the 
neutral base width Wg and has to be very thin to minimise the base transit time, 
(see Equation (2.143)). 

The series resistances have been added to the small-signal equivalent circuit in Figure 
2.68. From an application point of view, Rg and Rg are the most significant. 
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Fig. 2.67 


Fig. 2.68 
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A more realistic schematic cross-section of a typical bipolar transistor with planar contact 
arrangements. The dashed box indicates the intrinsic transistor — compare with Figure 2.58. 


Re 


E 


Small-signal equivalent circuit of the bipolar transistor, including the series resistances. 


Transit frequency. 

The total transit time through the bipolar transistor is calculated from the transit fre- 
quency fr, which is the frequency where the magnitude of the short-circuit current gain 
(h21 = ic/ip for Uce = 0) becomes one: tr = 1/(27 fr). 


tr = ta + tc + re (CBE + Cac) + (RE + Re) (Cag + Cac). (2.152) 
The total transit time can be separated as follows: 


e tp and 7¢ are intrinsic time constants which do not depend on the emitter current 
(neglecting the Kirk effect). 

e re (CBE + Cgc) is the intrinsic emitter charging time which is inversely proportional 
to the emitter current (see Equation (2.139)). 

e (Re + Rc) (Cpe + Cgc) is the parasitic charging time due to the emitter and collector 
series resistances, which is frequently neglected. 
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Schematic representation of the total transit time in a bipolar transistor as a function of the 
inverse emitter current. 


The effect of the intrinsic emitter charging time, which depends linearly on the emitter 
current (or, as @ * | in technical transistors, in good approximation on the collector 
current), leads to a strong bias dependence of the total transit time (and hence fr), 
which is shown in Figure 2.69. For Jz > Jg,opt, high-current effects such as the Kirk 
effect will again prolong the transit time. 

One of the key issues in designing high-speed bipolar circuits is therefore to choose 
the emitter current as close as possible to the optimum emitter current. 

Note that neither the base resistance nor the output conductance have an influence 
on the total transit time — this is an effect of the definition via fp and hence h2;. The 
definition of h2; assumes an ideal current source at the input and a short circuit at the 
output. g¢e has no effect as it is short-circuited (neglecting Rc here), and Rg is in series 
with an ideal current source and hence also has no effect. 


Maximum frequency of oscillation. 
As already discussed, the maximum frequency of oscillation fmax is a measure of 
the power gain cutoff frequency (fr measures only the current gain behaviour): the 
frequency where the MAG of a two-port becomes one. 

A common approximation of fmax for the bipolar transistor is 


| fr 
eS 2.1 
Simax Si RaCac (2.153) 


This equation is equivalent to the one introduced for FETs; see Equation (2.27) for very 
low output conductances and replacing Rg — Rp, Cop — Cpc. For an in-depth treat- 
ment, see M. B. Das [11]. As explained there, the simplification neglects the distributed 
nature of the base resistance (as we did in this introductory text) and is only valid 
if the emitter series resistance Rp and output conductance ge are sufficiently small. 
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In practice, however, Equation (2.153) is useful even for today’s HBTs with several 
hundred GHz fimax. 
Note that now Rg has a strong influence on the maximum frequency of oscillation. 


Microwave noise performance of bipolar transistors 


We will investigate the microwave noise performance of bipolar transistors using a 
simplified noise equivalent circuit (see Figure 2.70). 

The series resistance Rg and Rc will be neglected, as will be the output conductance 
&ce and the base—collector capacitance Cgc. This leaves three different noise sources to 
be included: 


(i) the thermal noise associated with the base resistance Rp: (|vnb|7)3 
(ii) the shot noise associated with the emitter—base p—n junction: (lvaal*)s 
(iii) the shot noise associated with the base—collector p—n junction: Wteel?), 


Due to the short base transit time, the emitter—base and base—collector shot noise sources 
are strongly correlated. 

The rationale for the omission of the collector resistance is that its contribution would 
be divided by the gain of the transistor; further the value of Rc is typically much smaller 
than Rg. The thermal noise source of the emitter resistance with a squared spectral 
voltage density of 8kT Rg'! would be in series with the shot noise source of the emitter 
current, whose spectral voltage density is 


(Ienel?) = 44 Ter? =4KkT re, (2.154) 


as re = kT /(q Ig). As long as re >> 2Rz, the thermal noise contribution of the emitter 
resistance can be neglected. Because low-noise bias points for bipolar transistors occur 
at small Jp, this can generally be assumed. 
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Strongly simplified T-type equivalent noise circuit of a bipolar transistor. 


'l Magnitude of the complex phasor. 
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Using the equivalent circuit in Figure 2.70, Hawkins [23] derived for the bipolar 
transistor: 


Rp + Ropt ao 


le |a(w)|?’ 


Fmin = a (2.155) 
where a(w) is the frequency-dependent common base current gain and ap its quasi- 
stationary value: 

ao 


MOST jlo 


with the base cutoff frequency @p = Ty "and Tp the base transit time Equation (2.143). 
The parameter a is 


=e 


The cutoff frequency we represents the emitter charging time: 


1 qglp 


ve CBEre ~ kT CBE_ 


Ropt 18 the real part of the noise-optimum generator impedance: 


2 2 a re(2RB +e) 
—— (s Ae e@e a 


and X op the imaginary part: 


ao Cpere 
ad 2 
la(@)|- a 


X opt = 


It is instructive to consider the quasi-static case, w — 0. In this case, 


1—ao 
az 
ao 
X opt = 
2 re (2Rp + re) 
Ropt = 8 + Aca 
We obtain therefore 
1 R Ri 2Rep+r 
Fmin(o > 0) = — + —* +f B+ (1-0), (2.156) 
ao = Bore (Bo re) le 


where fo = ao /(1 — ao) is the common-emitter small-signal current gain. 
We conclude that for @ — 0, the minimum noise figure does not converge towards 
1, as in FETs, (see e.g. Equation (2.29) for the MESFET), but a higher value which 
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depends on the current gain and the base resistance. If we further assume small 
Rp/(Bo re), Equation (2.156) reduces to 


1 2Rp +r 
Fmin(o > 0) ® — + fo =) 
oA) Te 


It is obvious that the current gain has a very important influence on the noise 
performance of a bipolar transistor. 

Let us now investigate a medium frequency range @ «K w < wp. To simplify 
matters, we assume an ideal current gain a = |. In this case, 


ea aly (2.157) 


We find that in this case the increase with frequency is determined by the base resistance 
Rp, which is therefore a very important parameter for the microwave noise behaviour 
of bipolar transistors. 

Equation (2.155) contains an implicit bias dependence via re = Vr/IJp, where 
Vr = kT/q, as usual. re also determines fe. As long as Ig >> Vrw Cpe, Fmin 
will increase proportionally with increasing 7g. For very small Jz, however Fmin will 
increase inversely proportional to Jz. We note that here will be an optimum emitter cur- 
rent with respect to the noise performance. This current is usually much lower than the 
current required for optimum fr (Figure 2.69) and hence requires a trade-off between 
device speed and noise in circuit design. Figure 2.71 shows an example calculation. 
We note that the optimum emitter current is frequency-dependent and moves to higher 
currents with increasing frequency. 

An important noise parameter not considered in Hawkins’ theory is the equivalent 
noise resistance. It determines the sensitivity of the noise figure on deviations from the 
noise-optimum generator impedance. Therefore, a small R, facilitates circuit design as 
it makes exact noise match less critical (Section 5.3). Using Hawkins’ equivalent circuit, 
an expression for Ry, was introduced by Pucel and Rohde: 


wom (o-a)eslor(B) [-me(h) 
SG) os 
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Example calculation of bipolar noise figure dependence on the emitter current. Parameters 
chosen are: ff) = 50 GHz, Cpr = 0.2 pF, ag = 0.99 and Rg = 10 Q. 


The newly introduced parameter D is 


o=a['+(5)] 


We note the importance of a low base resistance to achieve a small Rp. 


Transit time optimisation 
Drift field in the base 


We had explicitly assumed that charge carriers traverse the base by diffusion only, that 
any electric field in the neutral base can be neglected. This is reasonably true provided 
that the base layer is highly and uniformly doped. 

Any significant variation in doping concentration will lead to a built-in electric field 
which will either enhance or impede the carrier transport in the base. Advantageously, 
we make the base doping concentration higher at the base-emitter junction than at the 
base—collector junction, introducing an accelerating field for charge carriers travelling 
from emitter to collector. 

Figure 2.72 shows the schematic band diagram of such a structure. The conduction 
and valence bands in the neutral base are titled due to the doping variation, adding a 
drift field force acting upon both electrons and holes. A constant electric field results if 
the doping concentration is exponentially varied: 


NaB(y) ~e *%. 
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Band diagram of a bipolar transistor with variation of the doping concentration in the base, 
creating a drift field. 


If the base doping is adjusted using ion implantation from the emitter side, as is com- 
monly the case in today’s bipolar technologies, a suitable doping profile automatically 
results. 

The built-in field can be easily calculated provided that the Boltzmann approximation 
is assumed to be valid: 
kT In NA.B.max 


2qWp Napmin 


€y,bi = 


The base transit time under the influence of this built-in field is then [65]: 


t= (2.159) 


Even modest variations of the base doping concentrations can result in substantial 
reductions in base transit time. 


Collector transit time optimisation 

The collector transit time Equation (2.151) can become a significant part of the 
total transit time, especially in devices with high breakdown voltages. Optimising the 
collector design involves important design compromises: 


e For high fr, the device needs to be driven to high collector currents, minimising the 
emitter charging time constant. Therefore, the Kirk effect needs to be pushed to higher 
currents, demanding a larger collector doping concentration. Equally, the collector 
transit time needs to be reduced by reducing the depleted collector width Wc. For 
a given collector—base voltage, this agrees with the demanded increase in collector 
doping concentration. 

e However, a high maximum frequency of oscillation needs a low Cgc which, 
for a given collector—base voltage, demands a decrease in the collector doping 
concentration. 
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e Equally, an increase in collector—base breakdown voltage needs a decrease in collector 
doping concentration and a larger Wc. 


The link between breakdown voltage and transit frequency is frequently expressed in 
terms of the Johnson limit [17]. To derive the Johnson limit, let us assume that the 
collector transit time fully dominates the total transit time. Using Equation (2.151), we 
find 
1 Usat Usat 
~~ = >Wo= ; 

fr 2m tT aWe . x ft 
If Ecrit 18 the critical field for breakdown and if we assume homogeneous doping in the 
collector, the collector—base breakdown voltage (open emitter terminal) is 

Wc 


BVcgo = Ecrit 3° 


The product of transit frequency and breakdown voltage will then only depend on the 
material properties Vzg, and BVcgo: 


Ecrit Usat 
fr: BVcBao = ——. 


The collector-emitter breakdown voltage BVcro is lower than BVcgo because the 

impact ionisation current is amplified by the current gain B when entering the base: 

BVcgo 
vB 


where m is a parameter which depends on the exact geometry and doping of the 


BVcrEo = (2.160) 


transistor. 
We find for the Johnson limit: 
Ecrit sat 


Qn */B 


For Si, Ecrit © 5 - 10° Vcm7!. The drift saturation velocity at room temperature is 
Vsat © 107 cms7!. Assuming a typical B = 250 and m = 4, we find fr - BVcgo = 
200 GHz. This is the frequently quoted ‘Johnson Limit’ for silicon bipolar devices. We 
readily recognise from Equation (2.161) that it is not a constant and can be significantly 
different for other values of B and m. 


fr: BVceo = (2.161) 


Heterojunction bipolar transistors 


The base design dilemma 
In our discussion of homojunction bipolar transistors, three main parameters with 
crucial impact on the high frequency performance were identified: 


(i) the base transit time which sets the ‘intrinsic speed’ of the transistors; 
(ii) the base resistance which affects the maximum frequency of oscillation and the 
noise performance; 
(iii) the current gain which not only influences the noise performance, but also has to 
be typically >100 to simplify circuit design. 
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The dilemma is that the base design parameters Wg, Na.p influence these parameters 
in different ways: 


e Tt ~ We and increases, albeit weakly, with increasing Na due to the reduction in the 
minority carrier mobility; 
e Rp ~ <1 — 


A transistor with high fr and fmax would therefore have a thin, highly doped base layer. 
However, the high base doping concentration will decrease the current gain if the emitter 
doping concentration cannot be proportionally increased. 

On the other hand, there are limits to the increase in emitter doping. The main limi- 
tation is bandgap narrowing. With increasing doping concentration, the band gap in the 
emitter will decrease. In silicon, 


(2.162) 


Np  300K\°° 
10'!8cm-3 T , 


AEg ® 22.5 ( 


The decrease in band gap in the emitter will increase the intrinsic carrier concentration 
there (njwe—£6/2kT) which in turn lowers the current gain because the base current due 
to injection of holes from the base into the emitter is 

2 
Neg 
Np,E 


Jp~ 


We therefore conclude that the base doping concentration cannot be be strongly 
increased while keeping a high current gain. Therefore, thin-base microwave bipolar 
transistors have a problem with rather high base resistances. 

Two approaches can be taken to solve the base design dilemma: 


(i) We can increase the carrier velocity in the base so that a target transit frequency 
can be met with a larger base width Wg. This has been discussed already in the 
context of doping variations in the base layer; we will see further down that the 
effect can be achieved much more elegantly using bandgap variations. 

(ii) We can search for a way to increase the base doping concentration while 
maintaining a sufficiently high current gain. This we will discuss first. 


The wide-gap emitter 

The first approach to solving the base design dilemma had already been discussed when 
deriving the equation for the maximum current gain, Equation (2.121). It was noted that 
it would be advantageous to fabricate the emitter from a material with a lower intrinsic 
carrier concentration, nj,2. Because 


ni=v NcNyelo/ kt) | 
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the bandgap in the emitter needs to be increased. Then the maximum current gain 
becomes 


2 
Jo _ Ineb — DnB We Npe "in NDE pAEG/kT 


_ — 2 (2.163) 
JB Jpbe = Dp We Nasri, = NAB 


Bmax = 


with AEg = Eg 5 — Eg 8, neglecting secondary effects like the differences in effective 
densities of states in the two materials. 
The enhancement factor e4£/*" can have very large values. Consider as an example: 


Emitter: Alp.25Gao.75As with Eg = 1.74eV 
Base: GaAs with Eg = 1.42eV 
This results in e4£6/4T — 2.9 x 109 at room temperature! 


In a technical transistor, this current gain enhancement is traded for a dramatic increase 
in base doping with a simultaneous decrease in emitter doping. For a Si homojunc- 
tion transistor, a typical doping combination is Np Etyp = [0 cm-?,..N, A.B.typ = 
10!8 cm~3, whereas for an AlGaAs/GaAs HBT, Np,E,typ = 5 X 10!7 cm73, Na.B,typ = 
4 x 10!°cm73. The reduction in emitter doping in the HBT is necessary to maintain 
an adequate reverse breakdown voltage of the base-emitter junction — the effect of cur- 
rent gain on the emitter—collector breakdown voltage B Vcgo was already discussed (see 
Equation (2.160)). 

Figure 2.73 shows the band diagram of a wide-gap emitter transistor with a graded 
emitter—base heterostructure. In a graded heterostructure, the two materials used for 
the emitter and the base are allowed to intermix over a certain distance. We note that 
the injection of holes from the base into the emitter, which constitutes a major part of the 
base current, now faces a much larger potential wall than the injection of electrons from 
the emitter into the base. This provides the intuitive explanation for the potentially huge 
increase in current gain. 

In an abrupt heterostructure, such as considered for the HEMT (Figure 2.17), we need 
to consider the effect of the conduction band and valence band discontinuities resulting 


E B Cc 


Band diagram of a wide-gap emitter HBT under bias (Vgp > 0, Vop > 0) with graded 
heterojunction. 
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Band diagram of an abrupt heterojunction wide-gap emitter HBT under bias 
(VBE > 0, Vcp > 0). 


Introduction of a drift field in the base using bandgap variation. 


from Anderson’s rule (Section 1.20.1). The conduction band discontinuity will lead to 
an additional energy barrier for electrons (see Figure 2.74). 

Assuming purely thermionic emission of electrons over the conduction band barrier, 
we can write in first order for the maximum current gain of the abrupt HBT, compare 
Equation (2.163): 


~ —AEc/kT 
Bmax,abrupt oa Bmax, graded e ch (2.164) 
tes NDE eAEv/kT 
Na.B 
The enhancement factor in this case is only related to the part of AEg which occurs in 


the valence band. In the above example, Alo.25Gao.75As/GaAs, the enhancement factor 
is now only 107, because AEy = 0.38 AEc. 


Drift base 


Using compositional grading in the base, we can also introduce a drift field in the base, 
as shown in Figure 2.75. The emitter has now the same band gap Eg, as the base imme- 
diately adjacent to the emitter—base junction. The band gap is then reduced towards the 
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collector to Eg.2 < Eg. The resulting bandgap difference AEG reduces the base 
transit time [9]. 


) 1 — e-AEG/kT 
TB, graded = TB,ungraded AEG/kT 1 AEG : (2.165) 


The bandgap reduction is very efficient in reducing the transit time — a modest AEg = 
4kT results in a 62% reduction of the transit time. 


HBT implementations 
Group IlI-V HBTs. 
HBTs fabricated from group IN—V materials such as AlGaAs/GaAs, GalInP/GaAs or 
InP/InGaAs typically have multiple-mesa structures such as the cross-section shown in 
Figure 2.76. Due to its cross-sectional shape, it is frequently referred to as a wedding 
cake structure. The structure can be fabricated with a minimum number of masks; the 
base contacts are usually self-aligned to the emitter mesa using a deliberate undercut of 
the emitter contact. 

While cost-effective in production, this structure has three major drawbacks: 


(i) The topology is strongly non-planar. This makes realisations of sub-micron lateral 
feature sizes difficult, as well as the implementation of multi-level interconnect 
systems. 

(ii) The necessary area for the base contacts and allowances for alignment accuracy 
necessarily lead to a base—collector area which is substantially larger than the 
base-emitter junction area. This leads to a larger-than-necessary base—collector 
capacitance Cgc, which in turn lowers the maximum frequency of oscillation (see 
Equation (2.153)). 

(ii) The base-emitter junction is not embedded in semiconductor material, but reaches 
the less-than-ideal interface with the passivation layer. This gives rise to enhanced 
surface recombination currents, which increase the non-ideal portion of the base 
current (see Equation (2.122)). As a consequence, III-V HBTs have a current gain 
which is strongly dependent on the collector current. 


N wide-gap emitter 


p* base 


n collector 


n* subcollector 


Semi-insulating substrate 


Generic HBT structure typical of II—V semiconductor materials. 
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Gummel plot of a typical AlGaAs/GaAs HBT. 


Figure 2.77 shows the Gummel plot representation of the collector and base currents 
of a typical AlGaAs/GaAs HBT. The Gummel plot displays the currents in a semi- 
logarithmic way as a function of Vg, for Vgc = 0. Ideally, base current and collector 
current both have an emission factor (ideality factor) of 1 (see Equation (2.121)). This 
would result in perfectly parallel curves for log Jc and log Jp, which is not the case 
here. The non-ideal base currents, with their emission factor >1, are seen predomi- 
nantly for very low base-emitter voltages and reduce the current gain there. This is 
a problem especially for low-noise operation. Furthermore, the surface recombination 
currents give rise to low-frequency noise (1/f or generation-recombination type noise), 
with negative impact e.g. on the phase noise of microwave oscillators. 

For the wide-gap emitter, AlGaAs was long the material of choice. It was more 
recently largely replaced with Ga,_,In,P, which for an In mole fraction of 0.5 is 
lattice-matched to GaAs. GalnP as the emitter material has several advantages: 


e The bandgap difference at the GalnP/GaAs junction occurs predominantly in the 
valence band. 

e GalnP can be selectively etched with respect to GaAs, allowing for an automatic etch 
stop on the base layer when structuring the emitter mesa. 

e The reliability of the base-emitter junction under current stress was shown to be 
substantially higher. 


For optoelectronic integration and millimetre-wave applications, HBTs are also being 
fabricated in InP substrates. The base is now Ino,53Gag,47As. As was discussed for the 
pseudomorphic HEMT structure, the electron mobility is substantially higher than for 
GaAs, leading to a much shorter base transit time (see Equation (2.143)). The hole 
mobility in InGaAs, however, is lower than in GaAs, leading to an increased base 
resistance. The emitter material is either InP or Ing.52Alo.4g As. 

HBTs with InGaAs base and collector regions have a major problem with low 
collector—base breakdown voltages, because the lower band gap of InGaAs lowers 
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Band diagram of a DHBT under active forward bias. 


the threshold for impact ionisation. Therefore, double-heterojunction bipolar transis- 
tors (DHBTs) are frequently used in this material system, with either InAlAs or InP 
as the collector material. DHBTs, however, introduce another problem, which is illus- 
trated in Figure 2.78. The heterostructure at the base—collector interface introduced 
additional energy barriers in the conduction and valence bands. The conduction band 
barrier impedes the collection of electrons and lowers the current gain. 

The valence band barrier is important especially after the onset of the Kirk effect 
(see Section ‘Kirk effect’). After all fixed donors in the collector have been neutralised, 
charge neutrality requires that with a further increase in current, free holes from the 
base are injected into the collector region. Due to the valence band barrier, however, 
this is restricted in the DHBT. As a result, holes accumulate at the base—collector inter- 
face, the bands bend upwards, and the electron barrier in the conduction band becomes 
higher. This leads to a much more severe deterioration of transistor parameters in the 
high-current regime. The problem can be avoided either by compositionally grading the 
base—collector junction, or by introducing a composite collector structure where the het- 
erojunction is offset away from the base—collector p—n junction into the collector [18]. 

Another problem related to the collector region is the aforementioned substantial 
Cpc due to the triple mesa structure of II—-V HBTs (see Figure 2.76). One solution is 
to fabricate the subcollector in a buried fashion by ion implantation beneath a semi- 
insulating layer, and to connect it to the collector contacts and to the collector proper 
via ‘sinker’ implants. 

An example for such a structure is shown in Figure 2.79 [44]. The subcollector 
is implanted into the semi-insulating InP substrate; the layer above has a drastically 
increased conductivity due to an Fe implant. Heavily n-doped local implants connect the 
buried subcollector to the collector contacts and to the collector itself, which is grown 
together with the base and emitter layers subsequently. While the base—collector area is 
not changed here, Cgc is still reduced because the collector layer itself is depleted in 
normal operation and therefore the reduced collector—subcollector interface area dimin- 
ishes the capacitance. Further, Cpc is less Vcg-dependent, which enhances linearity. 
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InP/InGaAs HBT structure with a buried subcollector. 


Finally, burying the subcollector improves the planarity of the structure. In the example 
shown, the InP/InGaAs DHBT demonstrated fr = 350 GHz and finax = 400 GHz. 

Another method to restrict Cgc is by damage implant through the base contact 
window prior to metal deposition. 

Cgc can be further reduced by eliminating the subcollector altogether and attaching 
the contact directly to the collector layer. To achieve this, the HBT structure must be 
inverted, i.e. the emitter contact is now at the bottom [50]. The collector must be accu- 
rately aligned to the buried emitter structure. A new problem which arises in classical 
collector-up HBTs is that now the emitter contact must be made laterally through a 
“sub-emitter’ layer, increasing the crucial emitter series resistance. 

The latter problem is addressed in the very aggressive ‘transferred substrate’ device, 
where the HBT structure is grown emitter-up. The emitter and base/collector structures 
are fabricated first. The structure is then flipped around and the emitter is attached to 
a Au metal structure which provides for the low-resistivity lateral emitter contact. The 
InP substrate is then removed and the collector contact is structured. The collector layer 
outside of the contact area is fully depleted and does not add extra capacitance. 

A schematic cross-section is shown in Figure 2.80 [47]. Together with submicron 
scaling (0.4 4m-6 jm emitter area, 0.7 |pm-10|1m collector area), an InP/InGaAs HBT 
with a transferred-substrate structure exhibited fr = 204 GHz and fmax = 1080 GHz. 


Si/SiGe HBTs. 

Unlike II-V HBTs, which are usually fabricated from lattice-matched heterostructures, 
HBTs in the Sij_,Gex material system are necessarily pseudomorphic (Section 1.20), 
which delayed their practical realisation until the late 1980s. They are commercially 
available since 1998 and have enjoyed an unparalleled technical and commercial 
success. 

Due to the large difference in lattice constant between Si (a = 5.43 A) and Ge (a = 
5.66 A), an elastically strained SiGe layer will necessarily be very thin, as was shown 
in Figure 1.35. The use of SiGe compounds is therefore restricted to the base layer — 
everything else is silicon, making Si/SiGe transistors necessarily DHBTs. 

The SiGe alloy can be used in two different ways: 


(i) The base may start with a zero Ge mole fraction at the emitter—base junction, and 
be increased towards the base—collector junction. The corresponding decrease in 
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Schematic cross-section of a transferred-substrate collector-up HBT [47]. 


(ii) 


band gap creates a built-in field for electrons in n—p—n transistors — a drift-base 
transistor results, with a band diagram similar to the one shown in Figure 2.75, 
except that the base—collector interface is now a hetero-interface. Pseudomorphic 
SiGe layers sandwiched between relaxed Si layers have an interesting property: 
the bandgap difference is almost exclusively in the valence band (see Figure 1.35). 
Hence, there is no parasitic conduction band barrier, at least not until high-current 
effects set in and the hole pile-up against the base—collector valence band barrier 
makes the bands buckle upwards. 

The major benefit of the built-in drift field is the reduction in base transit time 
given by Equation (2.165). Due to the emitter—base interface being a homojunc- 
tion, it is bound by similar base doping limitations as the homojunction bipolar 
transistor. 

This Si/SiGe drift-base concept has the significant advantage that the average 

Ge mole fraction in the base, and with it the built-in mechanical strain, is very low. 
In terms of processing temperatures, these transistors are fully CMOS-compatible. 
The drift-base heterostructure transistor is therefore the approach of choice in most 
Si/SiGe BiCMOS processes. 
Si/SiGe heterostructures can, of course, also be used to fabricate a wide-gap emitter 
structure. In this case, the Ge mole fraction is already significant at the emitter— 
base junction, leading to a significant valence band discontinuity, which allows 
to dramatically increase the base doping concentration (see Equation (2.164)). In 
these transistors, the Ge mole fraction is typically constant across the base. 

The major benefit of the wide-gap emitter structure is the high base doping 
concentration and resulting low base sheet resistance, which allows to achieve 
high cutoff frequencies despite very relaxed lateral scaling rules, e.g. ft, fmax = 
80 GHz with 0.8 pm design rules [55] . 


The two approaches may be combined, of course — the Ge mole fraction profile may 
start with a moderate non-zero value at the emitter—base interface and increase towards 
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the collector to a higher value at the base—collector junction, combining a hole-blocking 
effect towards the emitter with a built-in drift field towards the collector. This is called 
a trapezoidal Germanium profile in the base. The current gain in this case is [26]: 


Eg(y = 0) — Ec(y = Wa) gore niet 
kT 1 — e-lEa(y=0)—Eo(y=We)/kT ]’ 


BsiGe = Bsiny (2.166) 
where Bg; is the current gain of a homojunction transistor with the same geometry, 77 is 
the ratio of the position-averaged minority mobilities in the base of the two transistors, 
and y is the position-averaged ratio of the density of states product (Ny - Nc) across the 
base. The emitter—base junction is at y = 0, and the base—collector interface at y = Wg. 

Since 

lim ——— = 1, 

x>0 1—e-* 
Equation (2.166) reverts to Equation (2.163) for Eg(y = 0) = Eg(y = Ws). On the 
other hand, we see that having a pure drift-base profile (Eg(y = 0) = 0) also results in 
a certain increase in the current gain. 

Irrespective of the Ge profile in the base, a major advantage of the Si/SiGe HBTs is 
that they can harness the full potential of silicon technology, especially aggressive lateral 
scaling developed predominantly for CMOS process, different isolation techniques, and 
SiO? as a highly stable native oxide. 

A typical SiGe HBT in a commercially available technology has a structure similar to 
the schematic in Figure 2.81. Note the very planar structure compared to III-V HBTs, 
and the extensive use of SiO isolation. The n* subcollector is created by ion implanta- 
tion, after which a low n-doped Si layer is epitaxially grown and converted to SiO2 by 
local oxidisation, except in the areas below the collector contact and where the transis- 
tor structure will be. The collector area is doped using selective ion implantation, which 
allows for several collector doping concentrations on one chip, with different fp ver- 
sus BVcgo trade-offs. The transistor structure is then grown selectively in the transistor 
window. 


n* poly-Si emitter 


p* SiGe intrinsic base 


Silicide base links 


NNNANNNAANNS: 


et ee n* subcollector and collector ‘sinker’ 
p* -implanted extrinsic base 


p Si substrate 


Selectively implanted collector area 


Planar Si/SiGe HBT with implanted extrinsic base region. 
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Planar Si/SiGe HBT with a raised extrinsic base structure. 


The extrinsic base resistance is reduced by heavy pt implantation. This works well if 
the transistor is not aggressively scaled laterally. It does create, however, crystal faults 
immediately adjacent to the intrinsic base, which leads to enhanced diffusion of the 
p-dopant in the base and is a major obstacle to fabricating deep submicron lateral emitter 
widths. Additionally, the close proximity of the p* extrinsic base and the selectively 
implanted collector increases the base—collector capacitance. 

The latter problems are solved using a raised base structure, where a p* extrinsic layer 
is grown selectively on top of the base adjacent to the emitter, as shown schematically 
in Figure 2.82 [14]. A combination of these techniques with deep submicron scaling led 
production Si/SiGe HBT technologies to achieve fr and fmax values above 200 GHz. 


III-V versus Si/SiGe HBTs — a brief comparison. 
Si/SiGe HBTs displaced II—V HBTs in many applications primarily due to their 
supreme potential for large-scale integration, owing to their technological proximity 
to very mature Si processes. In terms of raw speed, as measured from fr and finax, 
record values are still scored by InP/InGaAs devices, but Si/SiGe HBTs are compet- 
itive, because they compensate for material deficiencies (e.g. the much lower electron 
mobility versus InGaAs), by aggressive lateral scaling and superior suppression of para- 
sitic capacitances. Further, Si has a significantly higher thermal conductivity than either 
GaAs or InP, which makes the thermal management of dense transistor arrays easier. 

In the area of microwave power amplification, however, III—V-based HBTs have an 
inherent advantage. When deriving the Johnson limit, Equation (2.161), we recognised 
the importance of the product of drift saturation velocity vst and the electrical field 
necessary for impact ionisation E¢,it. Taking vsat at an electric field of 1OkV cm7! 
Esat at a donor doping concentration of 10!7 cm7?, this product is shown for Si, GaAs 
and InP in Table 2.1. 

When comparing practical transistors, the ratio in fr B Vcgo between Si- and GaAs- 
based HBTs may appear even larger than the factor of 1.6 suggested by Table 2.1; but 


, and 


More free ebooks : http://fast-file.blogspot.com 


2.5.6 


Electronic devices 147 


Table 2.1 vsat - Ecyi¢ Product for Si, GaAs 


and InP 
Si 5,000 GHz V 
GaAs 8,000 GHz V 


InP 22,000 GHz V 


this is due to the generally lower current gain in the GaAs devices, which increases 
BVcEro. 


Large-signal modelling 


Bipolar transistor models have become increasingly complex. An exhaustive description 
of popular large-signal formulations is beyond the scope of this book. The following will 
concentrate on emphasising the major differences between the models, with respect to 
active forward operation of the transistor, quasi-static non-linear equations and avoiding 
extreme areas of operation. 


The Ebers—Moll model 

The Ebers—Moll equivalent circuit model was historically the first compact model of 
the bipolar transistor [16]. It approximates the intrinsic transistor as a network of 
two junction diodes and two current-controlled current sources (see Figure 2.83(a)). 
The parameter Ap is the common-base current gain in forward operation. Ap is the 
common-base current gain in reverse operation (emitter and base interchanged), which 
is not being considered here. For active forward operation, the base—collector diode is 
reverse-biased. Further, Ar Jc is much smaller than the forward current through the 
base-emitter diode and can hence be neglected. The resulting simplified equivalent 
circuit is shown in Figure 2.83(b). 

The emitter current in active forward operation is 


js = Hlape (c¥b/ (Ne Vr) _ 1) (2.167) 


where Jspp is the base-emitter saturation current, Ng the base-emitter ideality factor 
and Vr = kT/q the thermal voltage. 

Using the full equivalent circuit, the Ebers—Moll equivalent circuit can account for 
saturation (both diodes are forward-biased), but cannot model Early and Kirk effects. 
Further, the current dependence of the current gain at low Vpg can also not be included. 


The Gummel-Poon model 
An improved model of the bipolar transistor which is capable of including more of 
the non-ideal effects of bipolar transistors was introduced by Gummel and Poon in 
1970 [22]. 

The equivalent circuit (Figure 2.84) uses a voltage-controlled current source for the 
collector current — the transistor is seen in common-emitter configuration here. The 
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Fig. 2.83 Quasi-static Ebers—Moll equivalent circuit of the intrinsic bipolar transistor: (a) for forward and 


reverse operation; (b) simplified for forward active operation only. 
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Fig. 2.84 Gummel-—Poon quasi-static equivalent circuit of the intrinsic bipolar transistor. 


use of two parallel diodes for the base-emitter and base—collector junctions allows to 
include both the ideal (/gz, Jpc) and non-ideal (/;gp, /;3c) current contributions in for- 
ward and reverse operations of the transistor, with different emission factors. Hence, the 
current gain reduction at low Vgg (or Vgc in reverse operation) can be easily included. 
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The collector current formulation (shown here for forward operation only) uses a 
saturation current Js and a charge control parameter Op: 


lex a femmeiite Vr) _ @Vec/(NR i) , (2.168) 
B 


where NF is the ideality factor in forward direction and Np the ideality factor in reverse 
direction. Early and Kirk effects are modelled through the charge control parameter: 


On=S(14 1+402) 


— IS (Vee /(Ne Ve) ) 
22 == (e 4 \i (2.169) 


where Var is the Early voltage in forward direction and /KF is the knee current for the 
onset of high-current effects in the forward direction. 

In the active forward regime, Jgc and /-gc can be neglected and the base current 
becomes: 


Tg = Ipe + [BE 


~ (evse/(Ne Vr) 1) he (cvse/(Ne Vr) _ 1), (2.170) 


where BF is the ideal forward current gain, /sp the saturation current of the non-ideal 
base current and Ng the emission factor of the non-ideal base current. 

Extension of the equivalent circuit to the dynamic case is shown in Figure 2.85. In 
active forward operation, Cpg contains both the diffusion capacitance Equation (2.141) 
and the junction capacitance of the base-emitter junction, while Cgc is a junction 
capacitance only. The capacitance Ccs models the reverse-biased junction between the 
(sub-)collector region and the substrate node. On semi-insulating substrates, it is not 
necessary. 

The Gummel—Poon model also deals with the bias dependence of the base resistance 
which is frequently observed — Rp decreases from a higher value at low collector current 
to a much lower value at high collector current. This effect is due to a concentration 
of the emitter current towards the emitter periphery with increasing current — due to 
the lateral voltage drop in the base layer, the local base-emitter voltage is higher and 
closer to the base contact. As the local current depends exponentially on the local Vgr, 
even a small voltage change can lead to substantial redistributions in current. The base 
resistance decreases because the inner parts of the emitter—base area get increasingly 
detached. Due to the much lower base sheet resistance, this effect is less pronounced 
in wide-gap emitter HBTs. The Gummel—Poon model describes this effect using the 
parameters RB, RBM and IRB: 
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Gummel—Poon equivalent circuit with parasitic elements and substrate node. 


tan(z) — z 
z- tan2(z) 


Rp(p) = RBM + 3 (RB — RBM) 


(2.171) 


In total, the Gummel—Poon model implemented in SPICE contains 42 model parame- 
ters. A full discussion is therefore beyond the scope of this book. 


The VBIC95 model 
The VBIC95 model [37] is an extension of the Gummel—Poon model. Among others, 
the following problems are being addressed: 


e The description of base width modulation using a constant Early voltage is a 
simplification which only applies to small Vcr. 

e Self-thermal effects are not included in Gummel—Poon, yet play an important role 
especially for power amplifiers. 

e The collector resistance is not a constant, but depends on Vcp, because the undepleted 
part of the collector increases the series resistance. 

e Avalanche breakdown in the collector space charge region needs to be included. 
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Generic n—p-n BJT cross-section highlighting the parasitic p—n—p transistor. 
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VBIC9S5 equivalent circuit. 


e A major addition has been the implementation of a subcircuit for the parasitic p—n—p 
transistor, which is formed in Si-based bipolar transistors between the base, the col- 
lector and the substrate. This parasitic p—n—p can be easily recognised in Figure 2.86. 
Under certain bias conditions, it may sink an unexpectedly large current between the 
base terminal and the substrate node. 


Figure 2.87 shows the VBIC95 in a representation which emphasises its Gummel—Poon 
heritage. The distributed nature of the base resistance is accounted for. The collector 
resistance is now separated into a bias-dependent part which symbolises the contact, 
sinker and subcollector resistances, and a bias-dependent internal part modelling the 
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undepleted part of the collector proper. The substrate network is now much more com- 
plex and includes the parasitic p—n—p as a separate Gummel—Poon type equivalent 
circuit. Additional capacitances Cggo and Cgco have been added to account for over- 
lap capacitances between the poly-Si emitter plug and the base and collector contacts, 
respectively. 

Note that these are the only linear capacitances — all other capacitances are bias- 
dependent, even though this has not been noted in the equivalent circuit to enhance 
readability. 

The VBIC95 model implemented in newer versions of SPICE has 85 parameters, 
which also hints at the complexity of setting up such a model from measurements. 


The MEXTRAM model 
The MEXTRAM bipolar transistor model was created by Philips N. V. [42] and released 
into the public domain in 1993. It has been implemented in several industry standard 
simulation environments, such as several versions of SPICE and Agilent Advanced 
Design System (ADS). 

The equivalent circuit (Figure 2.88) shows stronger deviations from the Gummel-— 
Poon topology. The main current equation, however, shows the similarity: 


I 
Re (cVore/ VE eV aca!) (2.172) 
qb 
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Fig. 2.88 MEXTRAM model equivalent circuit topology. 
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Puiss 


Thermal equivalent circuit used by VBIC95 and MEXTRAM. 


Here, Vg2~; 1s the voltage between nodes B2 and El, while VB2C2 is a calculated 
quantity which corresponds to the voltage drop between nodes B2 and C2 — for an expla- 
nation of this and other intricacies, please refer to the MEXTRAM documentation [42]. 
qb is the normalised base charge, which is used to model both Early and high-current 
effects. This is conceptually as in Gummel—Poon, but the implemented equations pro- 
vide a higher level of accuracy, for example in the bias dependence of the Early voltage. 

The base is modelled as a distributed structure — this is a must for accurate modelling 
at elevated frequencies. The base—collector capacitance is split into three partial capaci- 
tances. The model does not only distinguish between an external and an internal part of 
the base, but models the sidewall interface between base and emitter separately Cen and 
OF.) Two diodes are used to model the ideal and non-ideal parts of the base current. 

The parasitic p—n-p transistor is also modelled here, even though this is less obvious — 
the current source between nodes C1 and S is exponentially controlled by the intrinsic 
base—collector voltage, Vgici. The reverse behaviour of the parasitic p-n—p, however, 
is not modelled. 

The major claimed advantage over VBIC95 is related to the modelling of high-current 
effects [30]. This is especially important for double-heterostructure transistors, such as 
Si/SiGe HBTs (see Figure 2.78 and its associated discussion). 

MEXTRAM has also been extended to account for neutral base recombination and 
base drift fields introduced through bandgap variations, again in an effort to make this 
model especially useful for Si/SiGe HBTs. 

Self-thermal effects are being simulated in VBIC95 and MEXTRAM in the same 
way (see Figure 2.89). The model calculates the sum of all powers dissipated in resis- 
tors and space charge regions as Pgiss. In the electric equivalent circuit, Pyiss is treated 
as a current which creates a voltage drop dT of the parallel connection Rin, Cin, which 
establishes the thermal time constant ti;. dT is analogous to the temperature difference 
between the device (taken to be at one single temperature — a simplification) and the 
ambient. It is then used as an additional control voltage for the bias-dependent cur- 
rent sources. This is a very common technique to include self-thermal effects, but it 
neglects the fact that the thermal conductivity of semiconductors, and with it the ther- 
mal resistance Rip, is temperature-dependent. With increasing temperature 7, Rip will 
also increase. 


The HICUM model 


The acronym of the last model to be discussed here already indicates its major claimed 
advantage — HICUM [51] stands for High-Current Model. Aside from being a general 
purpose non-linear bipolar model, with special emphasis on high-speed applications, it 
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HICUM equivalent circuit (adapted from [52]). 


concentrates especially on an accurate prediction of high-current effect. Remember that 
high-speed bipolar operation will occur at high collector current densities, minimising 
the emitter charging time (see Figure 2.69). Accurate assessment of high-current effects 
is therefore a must for any simulator with high-speed emphasis. HICUM’s development 
started in 1980 and it has been implemented in commercial simulation environments 
since 1994. Its model equations take a semi-physical approach to allow scalability and 
a certain degree of prediction. A companion program, TRADICA, facilitates the latter 
two issues. ! 

Figure 2.90 shows the equivalent circuit used in HICUM. The modelling of self- 
thermal effects is done analogous to Figure 2.89 and is not shown here again. 

The HICUM model does not have an equivalent element to the Gummel—Poon ideal 
current gain BF, but calculates the collector and base currents independently and treats 
the current gain as a derived entity. The main current in the HICUM model is the 
transfer current /rm, which can be compared to the intrinsic collector current Jc (see 
Equation (2.168)): 


eVBiEI/VT _ oVeici/ Vr 


Qp.t 
Qpo 


= Is (2.173) 


where Qp0 is the total hole charge in the base at zero bias. Note that unlike the Gummel— 
Poon expression, the exponential function does not have ideality factors. The deviation 
from non-ideal diode characteristics is handled in the bias-dependent hole charge Q,,T. 


2 An introduction to TRADICA is available at www.iee.et.tu-dresden.de/Schroter/Trad/features. pdf. 
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The formulation for Qp 7 in HICUM allows to include the effect of strongly varying 
intrinsic carrier densities across the base layer, as necessary for the simulation of drift- 
base HBTs [53]. In the model, this is done through different weighting factors being 
applied to the depletion charges at the base-emitter and base—collector junctions. 

The model can also accommodate the hole accumulation at the base—collector hetero- 
junction, with current gain roll-off and transit time deterioration, as needed in DHBTs 
[54]. For an in-depth treatment of HICUM parameters, refer to [52]. 


Differences between BJTs and HBTs relevant to large-signal modelling 
In general, the standard bipolar models discussed above are also applicable to HBTs, 
with appropriately chosen parameters. 

An important deviation concerns self-heating effects. In homojunction bipolar tran- 
sistors, both the collector saturation current and the current gain have positive tem- 
perature coefficients. The saturation current (see Equation (2.115)) increases because 
the intrinsic carrier concentration in the base depends exponentially on temperature. 
The current gain is limited by bandgap narrowing in the heavily doped emitter (see e.g. 
Equation (2.162)). The bandgap narrowing effect has a negative temperature coefficient, 
which lets the current gain increase with increasing temperature. 

In a wide-gap emitter HBT, the saturation current equally has a positive tempera- 
ture coefficient. The current gain, however, decreases with increasing temperature. To 
understand this, investigate again Equation (2.163): 


Bmax ~ eaEG/kT 


The enhancement factor therefore decreases with increasing temperature, because the 
valence band barrier gets less and less efficient. 

A frequently observed self-thermal effect in HBTs is the current crush in multi-finger 
HBTs (see Figure 2.91). At moderate dissipated powers, all fingers will have approxi- 
mately the same temperature and the current distribution is equal. With increasing Vcg 
and under constant base current, the collector current will gradually decrease due to the 
negative temperature coefficient of B. However, due to the strongly positive tempera- 
ture coefficient of the saturation current, a finger which is slightly hotter than the others 
will draw more and more current, deviating it away from the others, and increase its 
temperature. This strongly non-linear positive feedback will lead to a situation where 
the hottest finger takes on all the current, increases dramatically in temperature, with 
a resulting sudden decrease in current gain. This effect cannot be modelled with the 
standard bipolar models. 

Another important effect which cannot be simulated using the standard models is the 
rapid onset of high-current effects in DHBTs, discussed in the context of Figure 2.78. 

A minor effect, but worth mentioning, is that the non-ideal base current, see Equa- 
tion (2.122), may have a different Vgz dependence in HBTs. This is due to space charge 
region recombination associated with the conduction band discontinuity at an abrupt 
emitter—base heterojunction (see Figure 2.74). The potential well on the base side of 
the junction leads to an increased recombination, which depends in turn on the voltage 
across the junction. 
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Current crush phenomenon in a multi-finger HBT (http://parts.jpl.nasa.gov/mmic/mmic_ 
complete.pdf). Courtesy NASA/JPL-Caltech. 


Problems 


(1) Ina MESFET device, the following parameters are known from the fabrication 


(2) 


process: 

Gate Channel Drain/source 

Ti, Og = 0.7evV, Np =1-10'7em77, Rs = Rp = 102 
Lg = 1pm, layer thickness 

Woe = 100m, a=0.3um, 

Rg =12 Vsat = 1.2: 107 cms“! 


a) Draw the qualitative band diagram under the gate in thermodynamic equilib- 
rium. 

b) Calculate the Schottky gate built-in voltage and the pinch-off voltage. 

c) Assuming constant velocity throughout the channel, calculate the drain current 
for an applied bias of Vos — Vp = 2 V and Vps = 3 V. 

d) Calculate the device transconductance. 

e) What is the expected transit frequency? 

f) Calculate the expected minimum noise figure Fimin at a frequency of 2 GHz. 


A GaAs MESFET with a gate width of WG = 100m is specified with 
a transit frequency fr = 35GHz and a maximum frequency of oscillation 


Jmax = 50 GHz. The transconductance is gm = 21 mS, and the gate resistance is 
Rg = 22. 
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(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


Estimate the gate-source capacitance and the gate-drain capacitance, assuming 
that the output conductance is negligible. Can you provide an estimated value for 
the minimum noise figure? 

Due to a fabrication error, the gate resistance is increased to 5 Q. What is the 
impact on fT, fmax and Finin? 


Why does a MOSFET require an overlap between the gate and the source 
implantation region? What does this imply for device capacitances? 


In an n-channel MOSFET with a metal gate electrode, the original gate metal is 
replaced by a metal with a smaller work function. Explain qualitatively the effect 
on the threshold voltage. 


In a MOSFET technology, the thickness of the field oxide is chosen such that 
under the highest possible voltage between metallisation and substrate, no inver- 
sion channel can form at the SiO2/Si interface. Considering an Al metallisation 
with a work function of 4.1 eV and a bulk doping concentration of Na = 5-10!7, 
calculate the minimum thickness of the field oxide, if the maximum voltage 
between Al metallisation and the substrate is 5 V. 


A silicon-on-insulator n-channel MOSFET has a p-doped ‘bulk’ layer above the 
oxide layer with a doping concentration of Na = 2 - 10!©cm7?. The gate is 
heavily n-doped poly-Si. The gate oxide thickness is 5 nm. 

Calculate the thickness of the doped layer such that it will be fully depleted in 
active device operation. What is the purpose of the buried oxide layer? Explain 
its effect(s) on device performance. 


In a HEMT, what is the purpose of the spacer layer? Would the device still 
function without it? 


A HEMT device has the following layer structure: 
Function Composition Thickness Doping concentration 


Supply Alo,3Gao,7As 80nm Np =3-10!7?cm-3 
Spacer Alo.3Gag,7As Snm Nominally undoped 
Buffer GaAs 100nm Na =1-10'5 cm —!3 
Substrate GaAs 150m __ Intrinsic 


Calculate the threshold voltage of this device at room temperature. 
Let now Ves — Vorr = 0.5 V. Calculate the sheet density charge of the 2DEG. 


You want to optimise the gain and low-noise behaviour of a HEMT by changing 
the position of the gate electrode between source and drain. You observe the best 
performance if the gate is placed 

a) in the middle between the contacts 

b) closer to the drain contact 

c) closer to the source contact 

One or none of the statements is true — explain your choice! 


Draw the small-signal equivalent circuit of an FET for Vps = 0, making appro- 
priate simplifications. What is the relationship between Ccs and Cop in this bias 
point? Would you expect a noticeable difference between MESFETs and HEMTs 
in this mode of operation? 
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(11) Explain the two major advantages of a pseudomorphic HEMT structure, com- 
pared to the classic AlGaAs/GaAs HEMT. How do they relate to ft, fmax and 
Fin? Is there a potential disadvantage of the lower band gap in the channel? 

(12) In order to reduce the series resistance of the gate, FETs (MESFETs, HEMTs and 
MOSFETs alike) are typically constructed with several gate fingers in parallel. 
Which effect(s) on device performance will result from this measure? Will this 
affect the transit frequency fr? Explain your answer. 

(13) A HEMT technology has fp = 80GHz and gn = 600mS mm7!. For a device 
with Wc = 2- 60m, the gate resistance is measured to be Rg = 10 Q, and the 
minimum noise figure at 24GHz is Fnin = 1.7dB. For a device with WG = 
6 - 601m, what is the expected noise figure at 30GHz? You may neglect the 
source and drain resistances. 

(14) Only one of the following answers is correct: in a bipolar transistor in active 
forward operation, the base transit time is 
a) not a function of Vcg 
b) a weak function of Vcg 
c) astrong function of Vcr. 

Explain your choice! 

(15) Explain the following observations on high-speed bipolar transistors: 

a) In devices optimised for record fy, the maximum frequency of oscillation is 
often quite low, and the breakdown voltage is also low. 

b) Devices optimised for record fmax often have low fr, higher breakdown 
voltage, and need relatively high Vcg for optimum operation. 

c) The latter devices suffer from significant Kirk effect. 

(16) In n-p-n bipolar transistors, a drift field for electrons in the base can reduce the 

base transit time. This can be done in two ways: 

(a) introduce a continuously varying material composition; 

(b) vary the doping concentration in the base. 

Explain how to achieve a constant field strength in the neutral base using either 
of the two methods. 

(17) Note that Si/SiGe HBTs are always double-heterostructure devices. Why? 
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3.2 


Optimisation and parameter 
extraction of circuit models 


Executive summary 


Optimised device models are important in the design of electronic devices for specific 
performance. They help the designer to predict the behaviour of the device in an ana- 
logue circuit. However, standard methods of optimisation do not lend themselves to 
fast computation and may present problems with convergence. The simulated anneal- 
ing, genetic and structured genetic algorithms are alternative optimisation methods that 
help to solve the convergence problems by avoiding entrapment in local minima of the 
solution space. These methods are used for the extraction of the parameters of the equiv- 
alent circuit model of the device or to construct its neural network model. The neural 
network models are black box models that determine device outputs from known inputs 
after appropriate weighting values have been calculated. Another method of parame- 
ter extraction is the semi-analytical procedure that uses an analytic approach together 
with empirical optimisation methods. Basic expressions and approximations to extract 
small-signal equivalent circuit parameters are developed so that accurate device models 
can be obtained. 


Optimisation of device models 


Accurate device models are developed to predict behaviour that is in good agreement 
with experimental observations. The optimisation of the model parameters can be con- 
sidered to be the curve-fitting of the computed device characteristics to experimental 
data. Traditional gradient methods are computationally intensive and there are prob- 
lems with convergence and entrapment in local minima. Optimisation only involves the 
“trial” of a number of initial solutions to minimise the error. 

Combinatorial optimisation overcomes this problem of entrapment in local minima. 
To set up the device modelling problem as a combinatorial optimisation problem, the 
limits on each parameter are specified. Parameter values are then discretised so that a 
large but finite number of solutions is possible. An objective function, also referred to 
as the cost function is given by 


F(Vi) = (Mj — M)”, 3.1) 
j=l 
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where Vj (@ = 1, 2,...,”;m=number of parameters) = the model parameters to be 
determined; M; = a measured characteristic for the jth data point; M. ’ = a calculated 
characteristic which is a function of the parameters Vj for the jth data point; and m = the 
total number of characteristics to be fitted. Note that F(V) is a least square difference 
function. The number of possible solutions is extremely large and an exhaustive search 
for the optimum solution is practically impossible. Hence, it is necessary to employ a 
heuristic method. Such methods are well-established in CAD tools since they enable the 
designer to find a feasible solution in a finite period of time. A typical heuristic optimi- 
sation process utilises an iterative improvement strategy. At each step of the iteration, 
the algorithm generates a new solution and tests if it reduces the value of the objective 
function. If so, it accepts the new solution. If not, another new solution is generated and 
tested and eventually the global minimum of FV) in Equation (3.1) should be reached. 
However, the size of the solution space, as defined by all possible and physically mean- 
ingful parameters, is very large and grows exponentially with the number of variables 
in the model. An exhaustive search for the best solution cannot be performed in a finite 
period of time. 


Simulated annealing 


Most heuristic algorithms search for a solution only in the directions that improve the 
objective function. This type of heuristic search has a major drawback: it can be easily 
trapped into the local minima of an objective function. Figure 3.1 demonstrates the 
problem. 

The curve shown in Figure 3.1 is assumed to be the objective function of an itera- 
tive improvement process; the circles indicate the costs, i.e. F(Vj) in the least squares 
objective function given in Equation (3.1) calculated from certain parameter sets. Since 


Rejected 


Accepted 


\ 


Local minimum 


Least square difference 


Global minimum 


Different parameter sets 


Local minimum trapping in iterative optimisation. 
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a new set of parameters is generated by introducing small modifications to the model, 
its corresponding location on the curve is most likely to be somewhere near that of 
the original configuration. The traditional iterative improvement algorithms only accept 
parameter sets that have reduced the cost. This criterion of set acceptance implies 
that the process can only go downhill into a minimum and any uphill movement is 
forbidden. Depending on the starting point of the search, it is possible that this mini- 
mum is only a local minimum of the objective function. This search process generally 
does not have the capability of climbing over a peak of the curve to reach the global 
minimum. 

The drawback of this process can be overcome by using simulated annealing 
(SA)[12, 22, 23, 30]. This is a method of finding a near optimal solution for combi- 
natorial optimisation problems. The SA algorithm has the advantage of asymptotically 
producing the global optimal solution with a probability of one. This is achieved by 
making the following important modification to conventional heuristic methods. A cost- 
increasing solution may still be accepted. The probability of acceptance depends on: 
(1) a parameter called a pseudo-temperature Tx, which is artificially decreased as the 
iteration proceeds, and (2) the value of 


AF(V) = F(Vk) — F(V-i) (3.2) 


where Vx and Vx_; are the values of the parameter vector V at iteration steps k and 
k — 1. Specifically, a law similar to Boltzmann statistics is used to determine the proba- 
bility P of accepting a certain cost-increasing solution V_ at the kth iteration step. The 
probability function P is given by 


(3.3) 


P(Vx, Tk) = exp j= . 


Tk 


A careful choice of the initial pseudo-temperature 7p and a rule for decreasing the 
pseudo-temperature T are necessary to save computation time while being able to 
escape from the local minima. The temperatures are related by the equation 


Tk = &Tk-1, (3.4) 


where a is a constant between 0.8 and 0.95. Its value can be gradually increased 
from the lowest to the highest value. 7) > 500 is a satisfactory choice for device 
modelling. 

At each iteration, new parameter values are generated by first choosing one device 
parameter V; at random. A user-defined base value Vj—pbase is multiplied by a random 
number R, such that 0 < R < 1, and a variation AV; = RVj_pbase is introduced into the 
parameter V;. The new parameter value so obtained is used in the next iteration unless 
it exceeds prescribed limits, in which case it will be set to the maximum or minimum 
allowable value. 
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Hill-climbing capability of SA. 


A relative stopping criterion is used, since there is no guarantee that the device 
model can approximate experimental data closely. The optimisation process is stopped 
when the value of the objective function has remained virtually unchanged for several 
consecutive iterations (e.g. AF < 0.001 for 10 consecutive iterations). 

Figure 3.2 shows the same objective function as in Figure 3.1 with the hill-climbing 
capability of SA. 

Example: Application of SA to the modelling of a HEMT 
Optimisation by SA has been applied to the HEMT [30]. Three test cases are given as 
examples to demonstrate the optimisation process and evaluate its performance. Param- 
eters are allowed to vary within +90% of their initial values in test cases A and B. In 
test case C, the limits are reduced to +20% for realistic parameters. 

Test case A: AC Model of a HEMT 
The unilateral power gain U of the HEMT (also known as the MODFET as explained 
in Chapter 2) was determined by Roblin ef al. [21]. The values of U are sampled at 
various frequencies and used as the measured characteristic Mj to be matched to the 
model. The Mason unilateral power gain (defined in Chapter 5) is expressed in terms of 
the admittance (Y) parameters as 


Yor — Yi2) |? 
U= | (Yai — Yio) | 


= : 3.5 
4[Re(Y);)Re(¥22) — Re(Y12)Re(Y21)] a 


The Y-parameters are determined by primitive model parameters, i.e. the gate capaci- 
tance Co, the gate length L, the gate width Z, the bias voltage V = Vos — Vy and the 
parameter k which is given by 


Vos 


K=>————_, 3.6 
(Ves — Vr) " 


More free ebooks : http://fast-file.blogspot.com 


Fig. 3.3 


Optimisation and parameter extraction of circuit models 167 


100 Model Initial | Optimized 
Parameters Values Values 
V (Vv) 0.4 0.4 
80 - k 0.6 0.713 
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a Z (um) 100 106 
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Test case A: unilateral power gain curves and optimisation results [+, measured; solid line, 
optimised model; dotted line, initial solution] (M-K. Vai, S. Prasad, N. C. Li and F. Kai, JEEE 
Transactions on Electron Devices, Vol. 36, No. 4, pp. 761-762, April 1989. ©1989 

IEEE). 


where Vps is the drain to source voltage, Vgg is the gate to source voltage and Vy is the 
threshold voltage. 

Figure 3.3 compares the measured gain curve with curves generated from an initial 
model and the final optimised models. The objective function is reduced from an initial 
value of 4421.83 to 0.161095, virtually a perfect match. 

Test case B: Equivalent circuit of a HEMT 
The small-signal equivalent circuit of an intrinsic HEMT is used to deduce the unilateral 
power gain using Equation (3.5). The Y-parameters are expressed in terms of the circuit 
elements Rj, Cas, Rps, Cos, 2m and t. The objective function is reduced from 1783.78 
to 0.01789 when optimisation is completed. Figure 3.4 shows the agreement between 
measured and computed values using the optimised model. 

Test case C: Device design parameters 
In this test, a set of primitive device parameters (gate length, gate width, bias voltage 
and mobility) are obtained by optimisation such that the device has the highest value 
of fmax, the maximum frequency of oscillation. The analytical expression for fmax has 
been obtained by Roblin et al. [21] by setting the unilateral gain equal to 1. The primitive 
device parameters are related to fmax and the objective function is formulated. The 
initial design gives a value of fmax equal to 29.57 GHz. The optimised HEMT has fmax 
equal to 165.81 GHz. Such an optimisation is an aid to the device designer. Table 3.1 
gives the optimised values. The initial value of K is 0.6 and the optimised value is 0.95. 
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Table 3.1 Test case C: optimised design parameters for the highest fmax 


Model parameters 


Initial values 


Optimised values 


Vr(V) 0.4 0.4 
L(wm) 1 0.8 
Z(um) 250 300 
Mobility p(cm) ff (V-s) 4000 4800 
Co(pF) 0.1 0.108 
120 Model Initial | Optimized 
Parameters Values Values 
Rj (ohm) 7 8.02 
| C&s (pF) 0.6 0.06 
we Rep(ohm) | 0 0 
a Cep (pF) 0.06 0.114 
" Rpg (ohm) 120 138 
= g04 ps (PF). | 0.2 0.38 
= _ Im (MA/V) 56 49.53 
a by, . 3.61 
: % 
6 60- 
rom 
g 
£ 
& 40- 
‘c 
= 
20 - 
0 Lut | L Lol il Lotti 
001 .01 1 1 10 100 1000 


Frequency (GHz) 


Fig. 3.4 Test case B: unilateral power gain curves and optimisation results [+, measured; solid line, 
optimised model; dotted line, initial solution] (M-K. Vai, S. Prasad, N. C. Li and F. Kai, JEEE 
Transactions on Electron Devices, Vol. 36, No. 4, pp. 761-762, April 1989 © IEEE). 

3.4 Neural networks applied to modelling 


The SA algorithm has been applied to device modelling in the previous section. SA 
avoids the local minimum entrapment problem and has been shown to be preferable to 
other optimisation methods since it is relatively insensitive to initial conditions. How- 
ever, it inherits the time-consuming feature of iterative improvement methods. As was 
shown, the probabilistic hill-climbing operation increases the time taken for the comple- 
tion of the optimisation. Furthermore, the solution has to slowly “cool down” according 
to the annealing procedure. Consequently, a large number of intermediate solutions have 
to be generated and evaluated at each pseudo-temperature which has to be decreased 
slowly from a large initial value. This disadvantage of the SA algorithm for optimisation 
can be overcome by using an artificial neural network (ANN). 
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ANNs are based on the human nervous system which consists of a distribution of neu- 
rons to carry messages back and forth to the brain. The ANN has been used successfully 
in many modelling and optimisation applications in engineering that are particularly 
useful when several tasks are to be performed in parallel and computation rates are 
required to be high [31, 37]. Neural networks are particularly attractive because of their 
speed and accuracy. Hence, neural networks have been developed into an alternative 
computer-aided approach to model and design devices and circuits. Neural networks 
represent a robust modelling approach to predict the behaviour of high-speed devices 
and circuits [29]. In comparison with various statistical methods and curve-fitting 
approaches for predicting system behaviour, the neural network approach features a 
learning process which fine tunes neural network parameters to interrelate the variables 
being modelled. A neural network may be developed to guide the solution generation of 
an SA optimisation process. This approach utilises the associative capability of a neural 
network to globally and concurrently evaluate the effect of varying all the parameters. 
When used in place of a physics-oriented device model, a neural network avoids the 
need to repeatedly solve the equations that describe the device physics. 

Two classes of neural networks are used in modelling: 


(i) Multi-layer perceptron neural networks 
(ii) Recurrent Hopfield neural networks. 


Regardless of the neural network architecture selected for an application, it consists 
of many processing elements called neurons, each connected to many others. Every 
connection entering a neuron has a weight assigned to it. This weight is used to amplify, 
attenuate or change the sign of the signal in the incoming connection. The input to 
the neural network is a vector of the data to be modelled. Each neuron operates on 
the outputs of other neurons according to its transfer function and delivers an output 
to other neurons. Often, the transfer function sums the incoming signals to determine 
the value of the neuron’s next output signal. The result is an output vector representing 
some characteristics associated with the input. 

In order to use neural network algorithms, it is necessary to determine an intercon- 
nection pattern, the weights and the transfer functions. The creation and training of an 
appropriate neural network for the problem on hand is difficult and time-consuming. 
However, an appropriately trained neural network provides fast and efficient solutions 
that have shown excellent results for different applications such as the modelling of 
transistor behaviour and microwave circuits as well as microwave impedance match- 
ing [28]. A neural network consists of a set of simultaneous non-linear equations that 
are capable of modelling any continuous function when the appropriate weights are 
determined. These networks are pictorially represented as neurons (circles) with inter- 
connecting nerves (lines). Neural networks are very flexible tools for device modelling 
because they can be adapted to model different devices without change of equations. 
Adapting a physics-based model to a different physical device may involve a radical 
change of equations. This flexibility gave the impetus to the effort of finding much 
better ways of applying neural networks to device modelling. However, the problem 
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of finding the appropriate weights for the network is a highly non-linear one, sug- 
gesting the necessity to use stochastic optimising algorithms such as SA and genetic 
algorithms. 


Massively distributed computing networks 


A general description of the distributed computing methodology is given by Vai and 
Prasad [31]. Massively distributed computing networks are a specific form of a non- 
linear system that maps an input to an output. A distributed computing network can be 
considered to be an asynchronous array processor with very simple processing elements 
(i.e. neurons). Figure 3.5 shows a typical processing element, henceforth referred to as 
a neuron, with n inputs (i;,...,i,) and one output (Q). An input can be excitatory 
(indicated by a solid circle) or inhibitory (indicated by a hollow circle) and is assigned 
a weighting factor Wj. A threshold value T is associated with the neuron. 

The function of a neuron can be described by the following equation which combines 


inputs i7;,..., én to form an overall input value /: 
n 
=) W xi, (3.7) 
— 


where Wj is positive for an excitatory input and negative for an inhibitory input. If the 
overall input value J is above the threshold value T associated with the neuron, the 
neuron fires and an output of Q = | is produced. Otherwise, the output remains at 
Q = 0. A neuron is also associated with a time constant (tT) that determines its output 
response time. 

While the operation of a conventional computer is controlled by a series of instruc- 
tions, called a programme, a massively distributed computing network is programmed 
by wiring up a set of neurons and setting the weights of these interconnections. The 
function of a distributed computing network can only be determined by considering 
the network as an integrated entity. No meaningful information can be extracted by 
examining a neuron isolated from its neighbours. 

A distributed computing network is typically implemented by a hardware analogue 
circuit. Figure 3.6 shows the use of an operational amplifier configured as an integrat- 
ing adder to carry out the function of a neuron. As shown in Figure 3.6, the input 


Neuron structure. 
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From other neurons 
or external inputs 


W,| W, 


To excitatory inputs 
> 


> 
= To inhibitory inputs 


Physical implementation of a neuron (M. Vai and S. Prasad, JEEE Transactions on Microwave 
Theory and Techniques, Vol. 43, No. 5, pp. 1087-1094, May 1995. ©1995 IEEE). 


Q (output) 


T | (overall input) 


Neuron Transfer Function (M. Vai and S. Prasad, IEEE Transactions on Microwave Theory and 
Techniques, Vol. 43, No. 5, pp. 1087-1094, May 1995. ©1995 IEEE). 


weightings of such a neuron can be controlled by choosing appropriate resistance values 
connecting its inputs to the outputs of other neurons. The time constant (7) of this neu- 
ron is determined by the capacitance connected at the operational amplifier input. The 
neuron transfer function shown in Figure 3.7 shows the input-output transfer function 
of a neuron. 


Multi-layer perceptron neural networks 


The training algorithm called backpropagation [16] is used in the application of mul- 
tilayer perceptron (MLP) neural networks to device modelling. A multi-layer neural 
network with four layers (one input layer, two hidden layers and one output layer) is 


shown in Figure 3.8. Referring to the notations in Figure 3.8, X = (x1, ...,Xi,..-, Xm) 
is the input vector, G = (g1,...,8),---,8n), H = (h1,...,hk,...,hp) and Y= 
(Y1,--+, YI,-+++, Yq) are the outputs of the first hidden layer, the second hidden layer 


and the output layer, respectively; uj; is the weight between the ith neuron and the jth 
neuron in the first hidden layer; vjx is the weight between the jth neuron in the first hid- 
den layer and the kth neuron in the second hidden layer; and wx is the weight between 
the kth neuron in the second hidden layer and the /th neuron in the output layer. Bias 
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Multi-layer neural network (M. Vai and S Prasad, Int’! Journal of RF and Microwave CAE, Vol. 
9, No. 3, pp. 187-197, March 1999. © 1999 John Wiley & Sons). Reprinted with permission of 
John Wiley & Sons, Inc. 


terms acting like weights on connections from units whose output is always | can also 
be provided to the neuron. They are not shown in Figure 3.8. 
The output of the neural network is computed as 


1 


LES T+e-v’ (3.8) 
where y¢ is the weighted total input to the output neuron £, which is defined as 
Pp 
ve = Y_ hwee, (3.9) 
k=1 


and p is the number of neurons in the second hidden layer. Similarly, the output of 
the second hidden layer H can be expressed as a function of the output of the first 
hidden layer G which can in turn be expressed as a function of the input vector X. The 
backpropagation training algorithm aims to adjust the weights of a MLP neural network 
in order to minimise the sum-squared error of the network, which is defined as 


S 


1 q 
E(n) = 075d [ami - sai | (3.10) 


m=1 l=1 
where n is the epoch number in the training process, S$ is the number of training 
data, g is the number of output variables and dm = (dmidm2,..-,dmq) and ym = 
(Ym1Ym2;---, Ymq) are the mth desired and calculated output vectors, respectively. This 
is done by continually changing the values of the weights in the direction of steepest 
descent with respect to the error function E. The iteration process continues until the 
error function is minimised. The learning is performed by the many presentations of 
a prescribed set of training examples to the network. One complete presentation of the 
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entire training set during the learning process is called an epoch. The learning process 
continues on an epoch-by-epoch basis until the weights of the network stabilise and the 
error function converges to a minimum value. 

There are certain problems related to the architecture of an MLP neural network 
such as the determination of the number of hidden layers and the number of neurons 
in a hidden layer as well as under-fitting or over-fitting. The backpropagation learning 
algorithm and its derivatives are sensitive to the number of neurons in hidden layers. In 
general, a network with too few neurons will fail to model the data (i.e. under-fitting). 
While the more the number of neurons in hidden layers, the better the network can fit 
the data; if far too many neurons are used, over-fitting can occur. In the absence of a 
deterministic approach that can find the number of hidden layers and the number of 
neurons, a trial and error approach is taken. The hidden layers are adjusted to strike a 
balance between memorisation and generalisation. 

A neural network trained with the relations between device parameters and behaviour 
can be used in place of conventional device models to speed up the simulation. Once a 
neural network model is trained, it provides a very fast prediction of results. Figure 3.9 
shows a simplified flow chart of the circuit design process. Beginning with an initial 
solution, a series of solutions is generated. The circuit property of each solution is 


Initial 


Solution v | 
Solution 
Generation 
Circuit Model 
Desired 
property 
Evaluation 
v 


Final Solution 


Flow chart for a circuit design process (M. Vai and S. Prasad, Int’l Journal of RF and Microwave 
CAE, Vol. 9, No. 3, pp. 187-197, March 1999. ©1999 John Wiley & Sons). Reprinted with 
permission of John Wiley & Sons, Inc. 
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predicted by a circuit model and compared to the desired circuit property. If the solu- 
tion on hand produces a circuit property close enough to the desired one, the design 
process is successful and terminated. Otherwise, another solution is generated and the 
above steps are repeated. The circuit model in Figure 3.9 often includes semiconductor 
devices which are commonly represented by the equations describing the physics of the 
particular device or equivalent circuit. 


Hopfield recurrent neural networks 


Although neural networks are known for their capability of learning the solutions to the 
problems that they are designed to solve, they also provide a framework for construct- 
ing special computing architectures to solve specific problems. The recurrent neural 
networks described here were proposed by Hopfield and are thus often referred to 
as Hopfield networks [11]. Consider a recurrent neural network of N neurons. If the 
activation of a neuron is updated according to the equation: 


N 
Viet +1) =sgn[ DOTY) + 4], (3.11) 
j=l 


where Vj(t) € (0, 1) is the state of neuron i at moment f, 7j; is the weight associated 
with the link between neurons i and j, J; is the internal threshold parameter of neuron i 
and sgn(x) is defined as 


sgn(x) = : (3.12) 
0, x <0 


It can be shown that an energy function defined as 


N 


N N 

1 

Bnergy = —5 D7) 1 TMi — DUA +K, (3.13) 
i=) j=) i=} 


where K is a constant, is minimised. 

The significance of a recurrent neural network is its ability to perform associative 
inference. There is no specific distinction between input and output vectors and a recur- 
rent network perturbed by changing one or more neuron states will evolve into one of 
its consistent states which are the minima of its energy (Equation 3.13). The recurrent 
neural network is now applied to implement qualitative models, the function of which is 
to explore many competing hypotheses in a solution space with constraints. The Neural 
Network toolbox in MATLAB is available to allow fast and efficient device modelling. 

Example: Modelling of HBTs using neural networks 
Small-signal models of microwave devices are very useful in circuit design. Neural 
networks can be trained to learn the non-linear relationship between the small-signal 
transistor behaviour and the device input bias conditions. Figure 3.10 shows a represen- 
tation of a neural network small-signal model of an HBT. The inputs of this model are 
the bias conditions of the device, i.e. the base current J), the collector—emitter voltage 
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Input Hidden Output 
layer layer layer 


Small-signal neural network model of the HBT. 


Vee and the frequency of operation f. The outputs of the model are the S parameters of 
the device. 

As shown in Figure 3.10, the small-signal neural network model is a three-layer 
model — there are 3 layers in the input layer, 12 neurons in the hidden layer and 8 neu- 
rons in the output layer. The 3—12-8 structure of the neural network model is arrived 
at after various trials as being the best compromise between model accuracy and the 
time required to train the model. The model is trained with a sample set of measured 
S-parameter data. Different input bias conditions are then applied to the model and S- 
parameters derived from the model are compared to measured data. The data used to 
train the neural network model consist of measured S-parameters at 12 different bias 
conditions (Vpe and J,), and a range of frequencies from 10 GHz to 40 GHz. The neural 
network modelling is tested for AlGaAs HBTs as well as SiGe HBTs. The models are 
accurate for both the material systems. The method should be applicable for all types 
of transistors regardless of the material system. The S-parameters for the devices are 
shown in Figures 3.11 and 3.12 for AlGaAs with bias Vee = 1.5 V, Jy = 56\A and in 
Figures 3.13 and 3.14 for SiGe with bias Vee = 1.9 V, Ip = 53 A. 

Figure 3.15 shows a representation of a neural network large-signal model of the 
device. The inputs of this model are the voltage bias conditions of the device: the 
collector—emitter voltage Vce and the base-emitter voltage Vie. The outputs of the model 
are the two output currents of the device: the collector current J, and the base current Jp. 

As seen in Figure 3.15, the large-signal neural network model consists of three layers: 
an input layer consisting of two neurons, a hidden layer consisting of three neurons and 
an output layer of two neurons. The 2—3-2 structure of the neural network model was 
arrived at after various trials. It is the best compromise between model accuracy and the 
time required to train the model. The model is trained with a sample set of measured 
DC I-V characteristics and Gummel data. Different input voltage bias conditions are 
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Fig. 3.14 The simulated and measured $7 and $21 for SiGe HBT. 
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Fig. 3.15 Large-signal neural network model of the HBT. 
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Fig. 3.16 I-V Characteristics of the AlGaAs HBT. 
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Fig. 3.17 I-V Characteristics of the SiGe HBT. 
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Fig. 3.18 Forward Gummel plot Jc versus Vpe for AlGaAs HBT. 


then applied to the model and output currents derived from the model are compared to 
measured data. The data used to train the neural network model consist of measured 
output currents (J, and Jy) at different voltage bias conditions (Vbe and Vee). The DC 
I-V characteristics (measured and simulated) of the devices are shown in Figures 3.16 
and 3.17. These figures show the collector current J, versus the collector—emitter voltage 
Vee at different base currents Ip. 
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Fig. 3.19 Forward Gummel plot J, versus Vpe for AlGaAs HBT. 
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Fig. 3.20 Forward Gummel plot /¢ (top curve) and J, (bottom curve) versus Vpe for SiGe HBT. 


Figures 3.18, 3.19 and 3.20 show the forward Gummel plot of J, and Jp versus Vpe. 
The success of the application of neural network modelling in the example is evident 
from the very good correlation between the measured and simulated data. 
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Optimisation of neural networks by the genetic algorithm 


Like neural networks, genetic algorithms are an optimisation strategy inspired by nature. 
Based on the Darwinian theory of evolution, these algorithms use the “survival of the 
fittest’ paradigm to find the best solution to a problem [8, 10, 32]. They iteratively 
evaluate several possible solutions choosing the ones that are the closest fit to the desired 
solution. The possible solutions are called chromosomes and are usually represented as 
strings of binary numbers called genes. 

The algorithm begins by randomly generating a number of chromosomes to form a 
population. Each chromosome is given a rank called a fitness index based on its close- 
ness to the desired solutions. The highest ranked chromosomes have a greater chance of 
being selected for the next stage of the algorithm: the reproduction stage. In this stage, 
pairs of chromosomes are separated at selected points and their genes are exchanged 
in a process called crossover to form a new generation of chromosomes. Because the 
most fit chromosomes are likely to be selected for reproduction in every generation, 
each new population is likely to consist of better solutions. Coupled with the fact that 
the reproduction process naturally eliminates less fit chromosomes from the population, 
the population gets pushed towards the desired solution in every new generation. When 
trying to reach the optimal solution for the problem, a genetic algorithm has to avoid 
local extrema or pseudo-optimal solutions. To prevent convergence of the algorithm 
to these local extrema, the genetic algorithm uses a technique called mutation which 
inverts the value of a randomly chosen gene with a given probability. A flow chart for 
the genetic algorithm is given in Figure 3.21. 

The genetic algorithm is a robust optimisation method well-suited to the difficult task 
of finding the optimum weights for a neural network. However, before it can be used to 
evolve the weights of a neural network, the following prerequisites must be worked out: 


(i) A suitable problem representation 

A chromosome of a neural network corresponds to the weight matrix for a 
layer. Substrings of the chromosome are made by concatenating the values of the 
real-valued weights between each neuron and the neurons in the previous layer. 
The substrings are then joined together into the chromosome for the layer. In 
some implementations, the real-valued weights are converted to binary values for 
enhanced gene exchange during crossover. However, it is sufficient to use strings 
of real-valued weights in most cases for genetic evolution. 

(ii) A fitness index 
A fitness index measures the closeness of the current output of the neural network 
to the desired output. It is the least squares norm of the difference. 

(iii) The reproduction strategy 
Reproduction involves two processes: crossover and mutation. During crossover, a 
pair of chromosomes are separated at a randomly chosen point along their length. 
The resulting substrings of the pair are switched and then joined to form child 
chromosomes. This is repeated for every pair of chromosomes in the population 


More free ebooks : http://fast-file.blogspot.com 


Fig. 3.21 


3.6 


Optimisation and parameter extraction of circuit models 181 


Create initial 
population 


| 


Evaluate fitness of 
initial population 


Desired fitness 


reached Ene 


Select individuals 


Crossover 
to produce offspring 


Mutate offspring to 
create new generation 


Flow chart for the application of the genetic algorithm. 


(iv) 


although some implementations will allow a small percentage of the fittest indivi- 
duals to pass to the next generation without reproduction. Mutation is applied to a 
small percentage of the population. In this operation, the value of a gene is changed 
by adding a small value to it. 

The termination criterion 

Although, it would seem desirable to train the neural network till the error is as 
small as possible, this may be neither practical nor ideal. It may take up to several 
days to train a neural network. In addition, an excessively trained network can fail 
to produce correct outputs when presented with inputs not included in the training 
set. This is called over-fitting. The appropriate termination criterion for a given 
application is best determined from experiments. A common way is to terminate 
the algorithm after a fixed number of iterations. 


Structured genetic algorithm 


Neural networks are usually designed by determining the optimum values of weights 
for a fixed number of neurons. However, it is impossible to know in advance how 
many neurons are optimal for an application. This difficulty can be avoided by using 
the structured genetic algorithm (SGA), an algorithm that enables the determination 
of the optimum number of neurons and weight values simultaneously [4]. SGA uses a 


More free ebooks : http://fast-file.blogspot.com 


182 


High-Speed Electronics and Optoelectronics 


hierarchical representation for the genetic structure in which neurons are a high level 
layer of binary-valued genes controlling a lower level layer of weight genes. When a 
high level gene is on (value = 1), the lower level weights genes it controls are activated 
and used in the computation of the neural network’s output. When a high level gene is 
off, the weights it controls are deactivated. Therefore, SGA is able to evaluate a variable 
number of neurons in the neural network. Neurons are turned on or off by the processes 
of mutation or crossover. The equations for SGA applied to a neural network with one 
hidden layer can be written as follows. Let X be a set of inputs to the networks and 
U the set of inner layer weights, then the output of a neuron in the middle layer is 
given by 


y= 6f (x uixi + ) (3.14) 


i=l 


where f (x) is the sigmoid function: 


1 
= ——_ 3:15 
f= WR (3.15) 
and 6 is one when the neuron is active and zero otherwise. 
If the set of the output weights is taken to be V, then the network outputs are 
given [36] by 


a=) wy: (3.16) 


Genetic algorithms are relatively slow algorithms and are best used to determine the 
weights for just the hidden layers of a neural network. The determined inner weights 
are then used to calculate the activations of the hidden layer neurons and this reduces 
the problem to a linear equation, where linear regression can be used to calculate the 
output layer weights. 

The SGA for neural networks is given in Figure 3.22. 

Example: Application of SGA to the modelling of a HEMT amplifier 
The use of the SGA is illustrated by creating a neural network model of a HEMT class- 
F power amplifier as shown in Figure 3.23. IMN and OMN denote the input matching 
network and output matching network. A class-F amplifier is a highly efficient switching 
amplifier used in mobile commercial and military systems. The design and properties 
of amplifiers are described in Chapter 5. 

Table 3.2 shows the inputs and outputs for one hidden layer neural network model. 
The upper limit for the number of neurons in the hidden layer was arbitrarily set to 
40. Before training, the data were scaled with a linear transform to lie in the range of 
the sigmoid activation function (0.1—0.9) used for the neurons. At the start of training, 
the network’s weights were initialised to small random values. The input to the SGA 
is shown in Table 3.3. Using the crossover and mutation operations, the initial weights 
were optimised until the termination condition was satisfied. The transfer function of 
the trained neural network and that of the HEMT are plotted in Figure 3.24. 
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Fig. 3.23 Simplified power amplifier schematic. 


Table 3.2 Neural network model for HEMT power amplifier 


INPUTS OUTPUTS 
Input power Output power 
Gate voltage DC gate current 
DC drain current Drain current 
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Table 3.3 SGA parameters for HEMT power amplifier model 


Parameter Value 
Initial population size 200 
Number of generations 10000 
Number of input neurons 3 
Number of output neurons 1 
Maximum hidden neurons 40 
Number of samples 140 
Crossover probability 0.8% 
Neuron layer mutation rate 0.0001% 
Weights layer mutation rate 0.0001% 
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Comparison of measured transfer function to values predicted by the neural network for the 


HEMT amplifier. 


Semi-analytical device parameter extraction 


Accurate, physically meaningful device models are necessary for circuit design, pro- 
cess technology design and optimum device design. The extraction of equivalent circuit 
parameters has been investigated by researchers for more than a decade [1—3, 5-7, 20, 


25-27, 33-35]. 


More free ebooks : http://fast-file.blogspot.com 


3.7.1 


Fig. 3.25 


Optimisation and parameter extraction of circuit models 185 


A semi-analytical parameter extraction procedure for the HBT equivalent circuit 
developed by Li and Prasad [15] is presented here as one illustration of parameter 
extraction. It combines analytical and optimisation approaches. The significance of this 
procedure is that it is completely general and can be applied to any semiconductor 
device using the appropriate equivalent circuit. 


Theoretical analysis 


An AlGaAs common-emitter HBT is used to illustrate this procedure. The small- 
signal T-model equivalent circuit is shown in Figure 3.25. The box with dashed lines 
in Figure 3.25 encloses the inner shell without the pad parasitics. All the impedance 
parameters given below are for the inner shell. There are 16 elements in the equiv- 
alent circuit. Only Che, re, &, Rpc and Cpe are considered to be bias-dependent and 
all the other elements are assumed to be bias-independent. The expressions for the 
two-port Z-parameters of the inner shell can be simplified in terms of the frequency 
ranges. The frequency ranges are characterised by wCpcRpe « 1 (low-frequency 
range), WCpc Rpi, @CFRvoi K | and wCycRbe >> | (intermediate frequency range) and 
@CpcRpi > 1 (high-frequency range). In the measurements on high-speed devices, 


@c 


if Coce 


inner shell (z) 


E 


The small-signal equivalent circuit of the AlGaAs/GaAs HBT (B. Li, S. Prasad, L.W. Yang and 
S. C. Wang, IEEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, pp. 
1427-1435, October 1998. ©1998 IEEE). 
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most of the frequency data are located in the intermediate frequency range. In the 
low-frequency range, 


Zit — Z12 = Rox + oly + Roi — jORve RoiCs (3.17) 
Zi2 = Re t+ jolLe+re — jor Coe 

+ (1 —a@) x joRpiCe(Roc — joR?.Cs) (3.18) 

Zn2 — Za = Ro + joe + Roc — JoRGCs, (3.19) 


Where Cy = Che + Cy. 
In the intermediate frequency range, the Z-parameters can be approximated as 


Cc 
Ziu—- Zi2= Ryx + joly + Roi (3.20) 
Ss 
Z12 = RpE+ joLle +re — jor Coe +(1-a) 
ee (3.21) 
x ix = i 
bi c. 


1 1 
Z22 — Z23 = R pol, 
22 21 ct Jo or Tae. Be, 


RoiCocCp —_, RoiC Ce 


; 3.22 
Ge : oC? Roc ( ) 
In the high-frequency range, the simplified relation is 
Z Z Reo + joL ; a ! 
— — (63) 
a CT IONE OR CocCt | O* ReiCoe Ct 
( —— ) 3.23) 
x : 

JOCER — J OCbc Roi 

1 —j@t 
a=ag ae uae (3.24) 

Lees 


Example: Extraction of equivalent circuit elements 

For the HBT used in this example, the intermediate frequency range is taken to be 
approximately from 0.5 GHz to 20 GHz, and the high-frequency range should go up to 
40 GHz. The condition for the high frequency is relaxed and frequencies over 25 GHz 
are considered to be in the high-frequency range. 

A. Extraction of the parasitic elements 

If no test structure is available for extracting the parasitics, it is still possible to extract 
or estimate the pad capacitances from the HBT under cut-off operation [7, 24]. Under 
cut-off operation, we have zero Vpg, zero J,, and variable Vcg. The HBT equivalent 
circuit of Figure 3.25 is reduced to the simplified circuit shown in Figure 3.26, provided 
the influence of the inductances and resistances remains negligible and the conditions 
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Cy 


To Cobe Cre 7 Croce 


The simplified HBT equivalent circuit under cutoff operations in which both junctions are 
reverse-biased and the influence of the inductances and resistances remains negligible (B. Li, S. 
Prasad, L.-W. Yang and S. C. Wang, IEEE Transactions on Microwave Theory and Techniques, 
Vol. 46, No. 10, pp. 1427-1435, October 1998. ©1998 IEEE). 


Roc > A/@Cpc) and Rpe > (1/@Che) are satisfied. The capacitances in the equivalent 
circuit can be calculated directly: 


Im(¥;) + Im(¥j2) 


Cpbe + Coe = en (3.25) 
Im(Yo2) + Im(Y, 
ne (Y22) (Y12) (3.26) 
Im(Y 
Cpe + Cr+ Cpe = — (3.27) 


In the above equations, the Cpbe, Cpbc. Cpce and Cp are considered to be bias- 
independent and Cpe and Cpe are bias-dependent elements. The value of Cpce can 
be calculated from Equation (3.26). Figure 3.27 clearly shows that Cpce is bias- 
independent. 

Che, the base-emitter junction capacitance, can be described by the equation 


2 Cjeo 
[1 + (Vee / Vip) ]"™ 


The extraction of Cppe can be carried out by fitting the sum Cpbe + Cre to Equa- 
tion (3.28) at different reverse base-emitter voltages or by using the iteration method 
in which different values of Vine, Mjbe and Cjpeo are tried until the plot of Cpbe + Coe 
versus [1 + (Vep/ Vibe) Mite is a straight line shown in Figure 3.28. 

Similarly, (C+ Cpbc) can be extracted by fitting the sum (Cf + Cpbe + Coc) to the 
expression for junction capacitance at different base—collector voltages. However, it 
must be noted that it is difficult to distinguish between the base—collector coupling 
capacitance and extrinsic base—collector capacitance [24]. This is due to the fact that 
the distance between the base probe tip and the collector probe tip in probe stations 


(3.28) 


used for most high-speed measurements is usually longer, and thus the coupling effect 
between base and collector contacts must be very small; furthermore, the influence of 
Cpbc can be absorbed by the extrinsic base—collector capacitance Cy. Thus, Cpbc can 
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Fig. 3.27 The calculated Cpce versus frequency (B. Li, S. Prasad, L.-W. Yang and S. C. Wang, IEEE 
Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, pp. 1427-1435, October 
1998. ©1998 IEEE). 
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Fig. 3.28 The fitting of the sum Cpbe + Che to the expression for junction capacitance (B. Li, S. Prasad, 


L.-W. Yang and S. C. Wang, JEEE Transactions on Microwave Theory and Techniques, Vol. 46, 
No. 10, pp. 1427-1435, October 1998. ©1998 TEEE). 
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be chosen to be zero. Such an assumption is also confirmed by the empirical opti- 
misation procedures. The S-parameters measured over the frequency range of interest 
(here the range chosen was 50 MHz—36 GHz) are first converted to Y-parameters. After 
de-embedding the effect of the pad capacitances, the Y-parameters of the inner shell are 
converted to Z-parameters. Most of the elements are extracted from an analysis of the 
behaviour of the Z-parameters. Certain constraints are obtained to help in conditioning 
the optimisation procedure and to reduce the uncertainty. 

B. The base—collector capacitance Cs 

As indicated in Equation (3.66), the following approximation is valid in the intermediate 
frequency range: 


1 RviCrCoe 
Oe, OC Rig 


Im(Z22 — Z21) = joLe + j (3.29) 


At the low end of the intermediate frequency range, the second term is much greater than 
the other terms on the right side of the equation. Cs can be extracted by the equation: 

1 
wlm(Z22 — Z21) 
The extracted C, from Equation (3.30) at the bias values for Vog = {0.5 V, 1.0 V, 
2.0V, 4.0 V, 7.0 V} and Jp = 200 pA is shown in Figure 3.29. 

At Veg = 0.5 V, 1/@Cs ~ 1224 Q, which is much larger than the other terms in the 
intermediate frequency range if the values extracted below are used. The base—collector 


Cs = (3.30) 


Cbc + Cf versus frequency 
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The extracted value Cy, at different base collector voltages (B. Li, S. Prasad, L.-W. Yang and 
S.C. Wang, /EEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 
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capacitance is observed to decrease as Vcg increases. This results from the increased 
width of the base—collector depletion region due to the increased Vcg. The deviation of 
Cs is less than 5% except when Vcg = 0.5 V. The base—collector junction is forward- 
biased at this value of Vcg when the intermediate frequency range moves up. 

The extrinsic base—collector capacitance is generally a weak function of the base— 
collector junction voltage. In extreme cases, it can be considered to be independent of 
the bias variation, or the ratio of the extrinsic capacitance to the total base—collector 
capacitance is considered to be a constant. Practically, the extrinsic capacitance Cf is 
the in-between case. For simplicity, the extrinsic capacitance is considered to be fixed 
and extracted from the values of C, at the different base—collector voltages. A method 
similar to that used for the extraction of the value of Cppe and Cpce is applied. The 
value of C, thus obtained can be compared with what is obtained from the cutoff mea- 
surement. The parameters for the base—collector junction capacitance are also extracted 
from this approach. 

C. The collector contact lead inductor L. 
The collector lead conductor L, can be calculated from Equation (3.29): 


1 
Lo = = | Im(Zas — Zy1) + : (3.31) 


wCs 
The third term in Equation (3.29) is assumed to be small enough and is neglected. This 
straightforward method is not as accurate as expected. The deviation of the extracted 
value of L, is large and an accurate value of L, is difficult to obtain. The reason for this 
is that the small error resulting from extracting C, could lead to large errors in L,. The 
differentiation of Equation (3.31) yields 


1 1 
AL. = —|—— | AC, 
: | =) ° 


: ae (3.32) 
=> -—,— X : 5 
wC, Cs 


Lg is very sensitive to even a 5% error in extracting Cs. The error in estimating L, result- 
ing from the error in the estimation of C, is plotted in Figure 3.30. L¢ is very sensitive 
to the error in extracting C,. However, L, is less sensitive to the error in estimating Cs 
if the magnitude of C, becomes larger. Therefore, a good bias point to extract L, would 
be zero bias at which the third term in the equation is negligible and the value of Cs is 
larger. The L, extracted at zero bias is shown in Figure 3.31. 

D. The base-collector resistance Rye 

The real part of Z22 — Z2, in the middle frequency range is given by 


1 RviCoeCe 
w RycCs G2 , 


Re(Z22 — Z21) = Re + (3.33) 


If the term 1/a@* RpcCs is much larger than the other two terms, Rp. can be approx- 
imately extracted from the real part of Z22 — Z2, in the lower middle-frequency 
range: 
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Fig. 3.30 The variation of Le with a 5% error in the estimation of Cs (B. Li, S. Prasad, L.-W. Yang and 
S.C. Wang, /EEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 
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Fig. 3.31 The extracted Le versus frequency (B. Li, S. Prasad, L.-W. Yang and S. C. Wang, IEEE 
Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, pp. 1427-1435, October 
1998. ©1998 IEEE). 
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1 
~ @2CsRe(Z22 — Za1) 


Rbe (3.34) 


The second term in Equation (3.33) is inversely proportional to w*; therefore, the mag- 
nitude of the second term decreases rapidly. The other two terms cannot be neglected as 
the frequency increases to a certain point. However, the extracted value of Rpc is not sig- 
nificant since the value of Rpe is very large and does not affect the frequency response 
much as long as we are only concerned with forward operation. Figure 3.32 shows the 
extracted Rpc at the bias Jp = 200A and Vcg = {0.5 V, 1.0 V, 2.0 V, 4.0 V, 7.0 V}. 
The magnitude of Rpe increases as Vcr increases. 

Figure 3.33 shows the extracted Rp¢ without de-embedding the pad capacitances at 
the bias Jp = 200A, Vcg = 1.0 V. The magnitude of the calculated Rpe is negative 
beyond | GHz. This shows that physically meaningless values may be obtained if no 
de-embedding procedure is carried out. 

E. The collector extrinsic resistance Re 

The R, could be extracted by plotting Re(Z22 — Z21) versus 1/w* in the high frequency 
range. The y-axis intercept is the value of R,. The requirement for the high-frequency 
range is difficult to be achieved and the conditions for the requirement are relaxed. Cs 
is bias-dependent and the larger value of Cs could be obtained from S-parameters at 
zero bias. Rc, should be extracted from zero bias by this method since the R, is more 
significant in Equation (3.23) at zero bias. 
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Fig. 3.32 The extracted Rp, at different base—collector voltages (B. Li, S. Prasad, L.-W. Yang and 
S.C. Wang, /JEEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 
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The extracted Rpe at the bias Jz = 200 WA, Vog = 1.0 V without de-embedding the pad 
capacitances (B. Li, S. Prasad, L.-W. Yang and S. C. Wang, [EEE Transactions on Microwave 
Theory and Techniques, Vol. 46, No. 10, pp. 1427-1435, October 1998. ©1998 IEEE). 


F. The base contact lead inductor Ly 
Applying the first-order approximation, Lp can be easily extracted from the imaginary 
part of Z1; — Zj2 in the middle-frequency range. That is 


Ly = Im(Z11 — Z12)/o. (3.35) 


The extracted Ly at different biases is shown in Figure 3.34 without the de-embedding 
procedure. 

The dependence of the value of Ly on the bias Vcg is attributable to the pad capa- 
citance. After the de-embedding procedure is carried out, the extracted Lp is shown in 
Figure 3.35. 

The magnitude variation of Lp at the different biases is very small and almost 
negligible, so Lp can be considered to be independent of bias. 

G. The intrinsic and extrinsic base resistances 

In principle, the sum of the intrinsic and extrinsic base resistances, Rpx + Rpi, can be 
extracted from the low-frequency data, and the extrinsic base resistance, Rpx, can be 
extracted from the high-frequency data if the equivalent circuit shown in Figure 3.25 
describes the frequency response of the HBT accurately. However, most of the fre- 
quency data are located in the intermediate frequency range. The requirement for the 
high frequency condition is difficult to be satisfied, and the data at extremely low fre- 
quencies are not available. The constraints on the base resistances thus can be obtained 
from the real part of Z,; — Zj2 in the intermediate frequency range: 


Cp 
Im(Z1, — Z12) = Rox + Roi a 


(3.36) 
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Fig. 3.34 The Ly versus frequency, in which the pad capacitances have not been de-embedded (B. Li, 
S. Prasad, L.-W. Yang and S. C. Wang, [EEE Transactions on Microwave Theory and 
Techniques, Vol. 46, No. 10, pp. 1427-1435, October 1998. ©1998 TEEE). 
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Fig. 3.35 The Ly versus frequency, in which the pad capacitance effect has been de-embedded (B. Li, 
S. Prasad, L.-W. Yang and S. C. Wang, [EEE Transactions on Microwave Theory and 
Techniques, Vol. 46, No. 10, pp. 1427-1435, October 1998. ©1998 TEEE). 
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The values of Rpx + Rpji(Cpe/Cs) after de-embedding the pad capacitances (B. Li, S. Prasad, 
L.-W. Yang and S. C. Wang, [EEE Transactions on Microwave Theory and Techniques, Vol. 46, 
No. 10, pp. 1427-1435, October 1998. ©1998 IEEE). 


The pad capacitances have a significant effect on the value obtained for Rpx + 
Roi (Cbc/Cs). The maximum variations of Rpx + Rbpi(Cpc/Cs) before and after de- 
embedding the pad capacitances are 5 and 0.5 respectively. The result after removing 
the pad parasitics is shown in Figure 3.36. 

This value is used to constrain the optimisation procedure in order to obtain accurate 
values of Rpx and Rpj. The variation of Rpx + Rpi(Cbc/Cs) is due to the change of 
Cbc With the base—collector voltage Vcg. Cpc decreases as Vcg increases. This causes 
the ratio Cpc/(Cbc + Cp) to decrease and hence the magnitude of Rpx + Rpi(Crc/Cs) 
decreases. 

H. The emitter resistance Ry and base-emitter resistance re 

Re + re can be obtained from the real part of Z;2 in the intermediate frequency range. 
With the high collector current where the neutral base recombination is the dominant 
recombination, Rg + re can be expressed as 


ng kT 


Ret+re = Re + : 
qle 


(3.37) 


The real part of Z)2 in the intermediate frequency range is the sum of re + Rg. The plot 
of re + Re versus 1/Jp would give the values of Rg, re and ideality factor n¢. 
I. The emitter lead inductor and base-emitter capacitance 

=C bets can be obtained from the imaginary part of Z12 in the low middle frequency 
range. In the case of high collector currents, the fraction of the depletion capacitance 
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in the base-emitter capacitance Cpe is small and Cpe can be approximated to be pro- 
portional to Jp, and we also have re « 1/Jg. Therefore, the y intercept of the plot of 
Le - #2 Che versus |/Ig gives the value of Le. The value of Le — Coar? at f = 5 GHz 
is used for this purpose. Le — Cure is plotted versus 1/J, and shown in Figure 3.37. 
Based on the values of Le and re obtained previously, the value of Cpe can be easily 
calculated. The value of Cpe obtained in this way only serves to give the initial value 
of Che. An accurate value of Che is obtained from the optimisation procedure. It is 
noted that the magnitude of Che is not sensitive to the optimisation procedure. This was 
also reported in [24] where the value of Che is calculated from fy (where fy is the 
transport factor a cutoff frequency and fy = 1/reCpe). An accurate value of Che is 
extremely difficult to obtain since changing the value of Cpe does not change the error 
of optimisation much over the bias ranges for this example. 
J. The transport factor a 
The transport factor w can be calculated directly by 


Z21 -Z 
= aa en (3.38) 
Z22 — Z21 — Re — joLe 
Assuming a single-pole approximation, one can write 
a : 
a= —_e i, (3.39) 
ine 
Da 


0.25 4 


T/IE (1/A) 
Le- Care versus the inverse emitter current. Le = 9.99 pH (B. Li, S. Prasad, L.-W. Yang and 


S.C. Wang, /EEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 
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where Ww, can be expressed as 


1 1 
= = (3.40) 
Coele Th 


Wa 


where Ty is the base transit time and is related to physical parameters by Wec/ 2D, for 
npn HBTs. The magnitude of a(w) at Jg = 200 WA with different collector—emitter 
voltages is shown in Figure 3.38 together with the fitted curve of the magnitude of a. 

ag is obtained by taking the value of || at low frequency and w, (and therefore the 
base transit time Tt, = 1/m@,) can be calculated directly at each frequency using 


a — la(w)|2 
Th = ———_. (3.41) 


wla(w)| 


The calculated tp at Jj = 200A with different collector-emitter voltages is shown 
in Figure 3.39. Since the base is heavily doped, the base width modulation effect in 
the HBT is negligible and therefore t, should be a weak function of Jp and Vcg. The 
dependence of tp on the base current and collector—emitter voltage is not completely 
clear. One possible reason for the a dependence on Vcg is the self-heating effect in 
the HBT. The diffusion coefficient Dy = (KT /q) [Mn is a function of the temperature in 
which “4y « TS. An often quoted value of s is 2.3 (for intrinsic GaAs). It is noted 
that the D, decreases when the dissipated power in the HBTs increases. Therefore, 
the tp, increases with larger Vcg values. The emitter—collector phase delay time can be 
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Fig. 3.38 |a| versus frequency at different collector emitter voltages (B. Li, S. Prasad, L.-W. Yang and 


S.C. Wang, /EEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 
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Tp versus frequency at different collector—emitter voltages (B. Li, S. Prasad, L.-W. Yang and 
S.C. Wang, JEEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 


eee La(w) — tan7! ( . )I. (3.42) 
(a) Wa 


where tT = (m/1.2)t + T. andm ~ 0.22. The calculated t versus frequency at Jp = 
200 WA with different values of Vcg is shown in Figure 3.40. 

When the collector—emitter voltage increases, the collector transit time t. = 
Wec/2vsat increases due to the larger base—collector space region. Therefore, the 
emitter—collector delay time increases as expected with the larger collector—emitter volt- 
age. This might be explained by the self-heating effect in the HBT. The emitter—collector 
delay time is a monotonously increasing function of tp and t,. As the power dissipated 
in the HBT increases, the temperature of the intrinsic part of the HBT increases. That 
causes the diffusion coefficient Dy to decrease and results in the larger base transit time 
Tp, and hence the larger emitter—collector delay time. 


Results of the parameter extraction 


The values of the bias-independent elements are given in Table 3.4. 

All of the bias-independent elements are extracted from the procedure described 
above except for Le, Rpj and Rpx. Accurate values of Le, Rpx and Rp; are obtained from 
the empirical optimisation procedure. Let a = Rpx + Rpi(Cpc/Cs) and y = Cpc/Cs. 
The initial values of Rpj and Rpx are estimated from the variation of a. We have 
Rox = Aa/Ay. The calculated values of Rpx and Rpj are listed in Table 3.4. Instead of 
defining just the absolute error and just the relative error, the mixed relative and absolute 
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Table 3.4 The bias-independent parameters 


Parameters Values (analytical) Values (optimised) 


Cpbe fF) 0 0 
Cpbe (fF) 27.4 27.4 
Cpce(fF) 41 42 
C;(fF) 16.5 16.5 
Lp(pH) 55 55 
Le(pH) 9.9 5.46 
L¢(pH) 61 61 
Rpx(Q) 1.38 1.42 
Rpj (Q) 2.3 4.049 
RR(Q2) 1.832 1.832 
Re(Q) 4.99 4.99 
Error 2.2 % 0.43 % 
-12 
4.5 7 10 . é : : 
4h 
IB=200 UA | : 
| a a TT rs ie ‘ 
3i o: VCE=0.5V 

x: VCE=1.0V 

+: VCE=2.0V 
ao *: VCE=4.0V 
Zz 9 --: VCE=7.0V 


0 5 10 15 20 25 30 35 40 
Frequency (GHz) 


T versus frequency at different collector—-emitter voltage (B. Li, S. Prasad, L.-W. Yang and 
S. C. Wang, IEEE Transactions on Microwave Theory and Techniques, Vol. 46, No. 10, 
pp. 1427-1435, October 1998. ©1998 IEEE). 


errors are used to obtain the best fit between the measurement and the simulation. The 
optimisation error is defined by 


N 3 2 e 2 
sme — saim\" {sme — sim 
E= 1/4N » ( gmea or mea 
i=] 2 
sme — sgim\" (sues — ssim'\* 
+ sme + sme ; (3.43) 
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Table 3.5 The bias-dependent parameters 


Vck(V) 0.50000 1.0000 2.0000 4.0000 7.0000 


Ic(mA) 6.0 6.2 6.1 5.9 5.6 
Che (fF) 109.1 64.75 41.50 26.8 18.70 
re(Q) 4.168 4.0432 4.218 4.518 5.018 
Rbe(kQ) 8.6 70 121 180 200 
ay 0.9741 0.9751 0.9751 0.9740 0.9740 
CA CF) 344 344 344 344 344 
f(GHz) 63 66 66 55 45 

tA (ps) 0.42 0.95 1.7 2.52 3.42 
Co a) 289 238 239 268 293 
f2(GHz) 58.2 58.9 52.3 47.10 39.9 
t°(ps) 0.158 0.69 1.09 2.08 3.14 
Error® 0.98% 0.84% 0.86% 0.62% 0.9% 


Error? 0.50% 0.69% 0.43% 0.41% 0.56% 


where N is the number of the frequency points. The errors between the measured and 
simulated S-parameters are also listed in Table 3.4. 

The optimisation is carried out at the bias Jpg =200WA and Vcp=2.0V. The 
error between the measurement and the simulation at the bias Jj =200)A and 
VcrE=2.0V before the optimisation is already 2.2%. The bias-dependent param- 
eters Che, le, Coc, Roc, &0, fy and t at constant base current Jp =200WA and 
VcgE={0.5V,1.0V,2.0V,4.0V, 7.0 V} are given in Table 3.5. Superscript A rep- 
resents the results from optimisation and superscript O represents the results from 
the direct analysis. Once the values of the bias-independent elements are known, 
all the bias-dependent values can be easily calculated and no further optimisation is 
needed. It is seen in Table 3.5 that, by using the directly calculated values of the 
bias-dependent elements, the error between simulation and measurement is very small. 
All the errors are less than 1%. Optimisations are also used. Only the three elements 
Coe, fo and t are optimised. The errors after optimisation are given in the Table 3.5. 
The variation of re is dependent on the collector current and the self-heating effect. 
As explained previously, accurate values of Cpe are very difficult to obtain. The varia- 
tion of Cpe may result from the numerical techniques. The bias-dependent parameters 
Coe; Te; Coc, Roc, HO, fo and t at constant collector—-emitter voltage Vcg = 2.0 V and 
Tp = {200 pA, 400 tA, 600 WA, 800 LA, 1000 LA} are given in Table 3.6. The errors 
in using the analytical approach and in using the optimisation procedure based on the 
initial values obtained from the analytical approach are both given. It is seen that the 
errors, using the analytical approach, become higher if the collector currents increase. 
This is because the self-heating effect becomes more significant when the collector 
currents increase. However, the bias-independent elements are forced to be fixed in 
all the extraction procedures and they are practically functions of the device tempera- 
ture. The thermal effect is absorbed by the bias-dependent elements after optimisation. 
Thus the errors become smaller. As expected, Cpe increases with increased collector 
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Table 3.6 The bias-dependent parameters 


Tp (wA) 200.000 400.000 600.000 800.000 1000.000 


Ic(mA) 6.1 14.1 22.6 30.19 38.8 
Che (fF) 41.50 37.4 32.7 28.75 27.15 
re(Q) 4.218 1.798 1.133 0.8482 0.6912 
Rpe(kQ) 200 200 200 200 200 
ag 0.9751 0.9785 0.9796 0.9790 0.9790 
CCF) 344 863 1275 1704 2190 
fi(GHz) 66 72 94 114 102 
t(ps) L7 1.6 1.52 1.73 1.2 
CP. (fF) 0.239 0.682 1.070 i537 1.984 
fO(GHz) 52.3 64.4 79.1 88.9 81.2 
t°(ps) 1.09 1.02 0.98 0.68 0.45 
Error® 0.86% 2.6% 3.2% 5.2% 7.6% 
Error? 0.43% 0.62% 1.1% 2.1% 2.9% 


The simulated and measured S$}; and S97. o: measured Sj; 7: simulated S;;; V: measured Sy); 
A: simulated S79 (B. Li, S. Prasad, L.-W. Yang and S. C. Wang, [EEE Transactions on 
Microwave Theory and Techniques, Vol. 46, No. 10, pp. 1427-1435, October 1998. ©1998 
IEEE). 


currents. It is observed from Table 3.6 that Cp. decreases with the increased collec- 
tor currents. One possible explanation is the self-heating effect and the modification 
of the base-collector space charge region by the injected carriers [24]. The effect of 
self-heating is not discussed here. The simulated and measured S-parameters at the bias 
Tp = 200 WA and Vcr = 2.0 V are shown in Figures 3.41—3.43. The excellent fit between 
the measured and modelled data shows that this procedure may be used successfully in 
device parameter extraction. 
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90° 20.0 


-90° 
Frequency 0.41 to 36.0 GHz 


Fig. 3.42 The simulated and measured $1.0: measured $21; ;7: simulated $2; (B. Li, S. Prasad, 
L.-W. Yang and S. C. Wang, JEEE Transactions on Microwave Theory and Techniques, Vol. 46, 
No. 10, pp. 1427-1435, October 1998. ©1998 IEEE). 


Fig. 3.43 The simulated and measured $7. 0: measured $17; 7: simulated S19 (B. Li, S. Prasad, 
L.-W. Yang and S. C. Wang, JEEE Transactions on Microwave Theory and Techniques, Vol. 46, 
No. 10, pp. 1427-1435, October 1998. ©1998 IEEE). 
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In summary, the pad capacitances are extracted from the HBTs under cutoff oper- 
ation. Most of the elements are obtained from the analysis of the behaviour of the 
Z-parameters. The values of uncertain elements are obtained from the optimisation at a 
specific bias. The initial values of these uncertain elements are also obtained from the 
analytical approach. 


Basic expressions for small-signal parameter extraction 


Small-signal equivalent circuit parameter extraction has been much addressed by several 
researchers [3, 6, 19, 20, 25, 26, 33, 34]. The basic expressions and approximations for 
the Z-parameters in different frequency ranges and under different bias conditions are 
treated in this section [14]. 


Theoretical approximations of Z-parameters for the HBT 


The simplified T equivalent circuit of the HBT after de-embedding pad capacitances is 
shown in Figure 3.44. The T equivalent circuit is used here since it is more physically 
meaningful than the a equivalent circuit. The latter circuit is the mathematical repre- 
sentation of transistor operation. It is also easier to infer the large-signal model from the 


Cc 
IT 
R 
be 
C.. 
= Ly Rox Roi © Rg Rex L, < Q 
. al, 
le i) Cre 
le Re 
V 
7 
e 
E 


Simplified T-type equivalent circuit, in which pad parasitics have been de-embedded (B. Li and 
S. Prasad, JEEE Transactions on Microwave Theory and Techniques, Vol. 47, No. 5, 
pp. 534-539, May 1999. ©1999 TREE). 
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Collector 


SI GaAs substrate 


Physical significance of the elements in the small-signal equivalent circuit (B. Li and S. Prasad, 
IEEE Transactions on Microwave Theory and Techniques, Vol. 47, No. 5, pp. 534-539, May 
1999. ©1999 IEEE). 


bias-dependent element parameters by the T equivalent circuit since there are straight- 
forward physical equations for the bias-dependent intrinsic elements. Figure 3.45 
depicts the physical significance of each of the circuit elements in Figure 3.44 and also 
includes the extrinsic parasitic capacitances Cpbe, Cpbe and Cpce. The active portion of 
the HBT was modelled using Che, Coc, re, Ale, Roc and Cs. Rg is the extrinsic emitter 
resistance which consists of the contact resistance and emitter region resistance. The 
extrinsic collector resistance was divided into three parts: Ro1, Rez and R3, which are, 
respectively, the resistance due to the n-collector, the n* access region, and the collector 
contact. The intrinsic collector resistance is represented by R,; which characterises the 
distribution effect of the base—collector junction at the collector side. Rej, Rez and R.3 
are lumped together as R,x in Figure 3.44. Similarly, the extrinsic base resistance con- 
sists of a contact resistance Rp; and an access resistance Rp2. Rp; and Rp are lumped 
together as Rpx in Figure 3.44. Rpj is the intrinsic base resistance. Finally, the distribu- 
tion effect of the base—collector junction is modelled by the elements Rpi, Rox, Ce, Coc 
and Rpc. Cpbe, Cpbe and Cpce modelled the coupling between the base-emitter, the base— 
collector and the collector—emitter interconnection layers. Le, Ly and L, are the contact 
leads of the emitter, the base and the collector respectively: 


Zpc = oe (3.44) 
1+ j@RvcCoe ; 
: le 
ZE=R L —_____—. 3.45 
E E+ jo $* Ta yarn (3.45) 
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(3.46) 


(3.47) 
(3.48) 


(3.49) 


The two-port network Z-parameters are given as follows (the derivation for two-port 
Z-parameters of this equivalent circuit is straightforward using basic circuit theory and 


the details are not given here): 


RoilZp + Rei + 1 — )Zpc] 


Z\1, = Zp+ZE+ 


A 
Rg l—a)Z 
Fp Fed Ray ot ) BC 
Zo = Zn + Rai Rpi + Ud — a)ZpcRpi — @ZRZBC 
A 
l—a)Z Rgl(Z Roi 
ye ee ay ce [( a)ZBC ~: cil ( F+ bi) 


where A = Rpi + Rei + Ze + Zpc. 


(3.50) 


Let Rj = 0 (in most cases, it is hard to distinguish between Rj and Rox, and Rei and 
Rex are lumped together as R,), the following equations are obtained after some simple 


calculations: 


ZF Roi 
Zgc + Zp + Roi 
(1 — a)ZpcRpi 
Zgc + Ze + Roi 
_  &ZpZpc 
~ Zao t+ Ze + Roi 
ZpZpBc 
Zpc + Zp+ Ry 


Z1, — Z12 = ZB t+ 


Zi2 = ZeE+ 


Z\2 — Z21 


Zy2 — Z213 = Zo + 


The most important term is 


ZeZBC 


2 = ——__.. 
Zpc + Zp + Roi 


(3.51) 


(3.52) 


(3.53) 


(3.54) 


(3.55) 


This term ‘¢ is a complicated function of frequency and all the right sides of the Equa- 
tions (3.51)—(3.54) can be expressed as the sum of a simple function of frequency and 


this term (except for a scaling constant). 
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Within the extreme low frequency range in which wCpcRpe < 1, 


ZeZBC 


Zpc + Zr 
Roe 


— fo(Ce + Coc) Roe + 1 
~ Roc — joR?,Cs, (3.56) 


Rx 


where we have used the assumption Rpce >> Rpj and let Cs = Cr + Che, since this 
is almost always the case in the forward or saturation applications. A typical extreme 
low frequency can be calculated by using the typical element values. In modern pro- 
cess technology in which the base resistance and base—collector capacitance have been 
greatly reduced, the typical values Cpe ~ 5 x 107!4 F and Rye ~ 5 x 10*Q, therefore 
the frequency f ~ 0.1/(27 RpcCpc) = 6.8 MHz. For microwave applications, this typi- 
cal frequency is very low. Substituting the expression in Equation (3.56) into Equations 
(3.51), (3.52) and (3.54), the equations reduce to 


Zit — Z12 = Rox + foLy + Roi — JOR RoiCs (3.57) 
Zio = Ret joLe +re — jor2Cpe 

+ (1—«) x joRpiCr(Roc — joR?,Cs) (3.58) 

Zog — Zn, = Ro + fol, + Ryo — jwR2,Cy. (3.59) 


The base-emitter resistance is small in forward bias and therefore, Z_ is approximated 
as Re + jole+re — jor? Che in this analysis. Z)2 can be simplified by discarding the 
higher-order terms: 


Zin = Ret joLle tre — jorgCre + j(1 — @)@RpiRocCs. (3.60) 
At low frequency, a > apc — 1. Hence, Z;2 is further simplified as 
Z12 = Ret joLe + re — jor2Cre- (3.61) 


The intermediate frequency range is characterised by WCpcRpi, @CeRpi <« 1, but 
@CbcRbc >> 1. The approximation for %t is given as 


vt = ZEZpBc 
‘ Zp + Zpc + Roi 
ZRZ Roi 
ins OEE oy (1 = ) (3.62) 
Zp + ZBc Zp + ZpBc 
7 1 i 1 1 JOC Coe Roi 
a Jos w*C?2 Roe Cs 
ae a 1 RviCocCe — f RoiCrCoe 
~ j@Cs w*C?2 Roe C oC? Roc 
1 1 RviCocCe 
~- + ; 3.63 
JoCs w*C?2 Roe Cc ( ) 
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where higher-order terms have been discarded. A typical intermediate frequency is cal- 
culated by using the typical element values. Assume Rpi = 10, the typical value 
of frequency is 0.1/(27 RpiCbc) & 0.64GHz. The value wCpcRpbce & 10 appropri- 
ately justifies the assumption. Substituting the approximation in Equation (3.63) into 
Equations (3.51), (3.52) and (3.54), we obtain 


Coc f RoiCe 


Z11 — Zj2 = Rox + JoLy + Roi 
11 12 bx + J@Lb bi CG aR 


(3.64) 


Cc 
Zi. = Ret joLle + re — joreCye + (1 — a) Rbi (=) (3.65) 
Ss 


1 1 RuiCocC. 
Zan — Za = Re + jole + = = (3.66) 


wCs + w RpeCg CG 
As indicated in the above expressions, initially 1 / w (Cr +C be)? Roc is much larger than 
RoiCpcC¢/C2 and R.. Re(Z22 — Z21), where Re denotes the real part can hence be 
used to extract Rpc. With increase of frequency, the second term decreases rapidly and 
Re(Z22 — Z21) is possible to be negative as described in [20], where the intrinsic base 
resistance Rp; is much higher in InP-based HBTs than in GaAs-based HBTs. After the 
higher-order terms are discarded, Equations (3.64) and (3.65) are reduced to 


; Cb 

Zit — Z12 = Rox + JOLy + Roi a (3.67) 
Ss 

Zi2 = Ret joLe +re— jor2Cbe (3.68) 


The fifth term in Equation (3.65) consists of only a very small fraction of the real 
part of Z 2; therefore, the approximation is generally reasonable. However, for InP- 
based HBTs it may account for up to 18% of the real part of Z)2 due to the large 
base resistance [20]. Self-consistent iterations can be used to correct the approximation 
errors [20]. 

The very high frequency range is defined by wCpc Rpi, @CERvi >> 1 


ZEZBC 1 


Roi : Zp + ZBc 
bi 


we 


Ie 


7 ( Zet+ — 
wo CysCeRhi Rpi 


il 1 1 1 
++ - +- (3.69) 
w*RyiCpcCp — @ RoiCocCe (com ser) 


The typical very high frequency is calculated by the values given above: 
f = 10/(2z RpiCpi) = 63.8 GHz. Since the very high frequency is very hard to reach, we 
normally relax the high frequency requirement to above 20 GHz. Substituting Equation 
(3.69) into Equations (3.51), (3.52) and (3.54), we obtain 


. 1 1 Coc 
Zi, —-Z12 =R iL 3.70 
i — Z12 = Rox + jo bt Gh WRuCaG (<s,) (3.70) 
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l-a l-a € 
i = Rade het tie Oe LS = ( ) (= +1) 


JOC ~ OP ReiCecCe Cre 
(3.71) 
Z Z Rcot+ joL ! + : ( : + : ) 
22 — Zo21 = Ro+ jw 
© @ RoiCocCe — @ RviCreCe \ FOC Roi JOC Roi 


(3.72) 


We divide the frequency range into three subdivisions. The boundaries of these three 
ranges are not strict and highly dependent on the applied biases, process technology 
and device design. The magnitudes of Rbc, Rpi, Cr, Cpc and re affect the definitions of 
the frequency range. The network analyser is used to measure S-parameters and then the 
S-parameters are converted to Z-parameters. The lowest frequency of the network anal- 
yser is about several tenths of MHz. Therefore, S-parameter data at the low frequency 
(satisfied by the condition wRpcCs « 1) are not of much use for analysis except in the 
reverse or deep saturation region. 

For the analytical approach to parameter extraction, we clarify that the intermediate 
frequency range is below 5 GHz and the high frequency range is greater than 10 GHz. 
The method of clarifying the frequency range before extracting the element parame- 
ters is arbitrary; however, the justification needs to be made during the procedure of 
parameter extraction. 


Z-Parameters at zero bias or ‘cold’ biases 


At zero bias (Vgg = OV and Vcg = OV) or ‘cold’ bias (Vgg = OV and Jp = OA; 
‘cold’ is adopted from the MESFET modelling techniques), the current gain becomes 
zero and the device behaves like a passive component. The base-emitter resistance re is 
also much larger. The magnitude of this resistance is normally greater than 10+ Q, which 
is within the same order of magnitude as the collector—base resistance. Therefore, the 
equivalent circuit becomes much simpler and so are the Z-parameters. With a = 0, 


we have 
fate 2 ke (3.73) 
Zpc + Ze + Roi : 
ZpcRpi 
Zo Za = Ze —_ 3.74 
12 21 E+ 704 Zp + Ru (3.74) 
ZpEZBC 


Z22 — Z23 = Zo+ (3.75) 


Zec + Zr t+ Roi 
The approximation of the Z-parameters is almost the same as before except for Z,2 and 
Z21. The approximation of Zg in this case will be rather different. The Zp at the low 
frequency range in which wreCpep < | is satisfied is given by 


Zp ~ Ret jole+re— jor2Cpeo. (3.76) 
Substituting into Equation (3.74), Z12 can be written as 


Zio ~ Ret joLle + re — jor2Cpeo + joRviCt(Roe — j@RZ,Cs). (3.77) 
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For simplicity, higher-order terms can be discarded in the above equation without 
affecting the accuracy. 
When the frequency increases to the point at which wreCpeo >> 1, ZE 18 approxi- 
mated as 
1 


Zp = Rp+ jole+- + : 
o— JOCve0 wre, 


(3.78) 


Actually, the assumption of wreCpeo, @RpcCoco >> 1 and @CpeoRvi, OCrco Roi K | is 
easily satisfied in the frequency range of interest; therefore, 


; Cp 
Zit — Z12 = Rox + fjoLy + Rei (3.79) 
Ss 
Fie = Ra Fk i ge (3.80) 
= (a) 5 ‘ 
12 E+ JoLe joGe, eee G bi 
1 1 
22. — Z23 = Ro t+ JolLe + + ; 3.81 
22 21 ct+yole Toe,  ORaee ( ) 


In the above equations, the higher-order terms have been discarded for simplicity. If the 
frequency increases further and 1/(@Cpe0), 1/(@Cbco) K Roi, then Z 12 is given by 


1 1 1 1 1 1 
Z12 = RE+ jole + Foes 2. wreC, oy Toc + Ree (— + =) : 
(3.82) 
In the case of both junctions being reverse-biased, the approximation of the Z- 
parameters might be rather similar to this case at zero bias. 


Parameter extraction 


The model parameter extractions are based on the assumption that all the extrinsic and 
parasitic elements are bias-independent and only elements in the intrinsic part of the 
device vary with bias. This also means that the extrinsic and parasitic elements can be 
extracted by using multiple bias information. Some element values are sensitive to the 
bias point at which the extraction procedure is carried out. 

Since the conditions at the intermediate frequency are easily satisfied in the fre- 
quency measurement range, we should make the most use of the Z-parameters in the 
intermediate frequency range. The extraction procedure is shown in Figures 3.46-3.49. 
The information for extracting the element parameters is overwhelming; hence different 
schemes can be developed. The extraction procedure shown here is just one of them. The 
superscripts used are explained as follows: R represents the real part of the correspond- 
ing Z-parameters; I the imaginary part; F the forward bias; 0 the zero bias; L the low 
frequency range; M the intermediate frequency range; and H the high frequency range. 

From Z1; — Z12, we extract Lp by plotting Im(Z1; — Z12)/@ versus frequency in 
high frequency range. Rpx + Rpi can be obtained from Re(Z1; — Z12) at the extreme 
low frequency. Rpx might be obtained from Re(Z1; — Z12) at the high frequency. The 
difference will be Rpj. 
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Ly (Z44-Z 2)!" @ Rox + pi <— (Z44-Z a) Rig (Z44-Z12) 4 


The element values extracted from Z,, — Z 12 (B. Liand S. Prasad, IEEE Transactions on 
Microwave Theory and Techniques, Vol. 47, No. 5, pp. 534-539, May 1999. ©1999 TEEE). 


Le-teChe<— Z1 FM Retigeee ze" 


Le, Che = Lor? Cg 1/1 Re, fe Me = Re+re~ 1/1¢ 


The element values extracted from Z1 (B. Li and S. Prasad, IEEE Transactions on Microwave 
Theory and Techniques, Vol. 47, No. 5, pp. 534-539, May 1999. ©1999 TREE). 


(Z42-Zq)/(Z2- Zo - Re -jw Le) 


V 


OLDer Thr Ter T fo, 


The element values extracted from (Z12 — Z21)/(Z22 — Z21 — Re — jwLc) (B. Li and 
S. Prasad, JEEE Transactions on Microwave Theory and Techniques, Vol. 47, No. 5, 
pp. 534-539, May 1999. ©1999 IEEE). 


Figure 3.47 shows how to extract Rg, re, Le and Cpe. Re +e is easily obtained from 
the real part of Z2 in the low or intermediate frequency range and we have 


nekT 
Re+re= Ret : (3.83) 
qle 


Therefore, we plot the curve Rg + re versus 1/Jp. Re, re andn¢ should be easily 
extracted. In the case of high collector currents, the fraction of the depletion capacitance 
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Ryo (Zo2-Zai)PM x «? (Cye + Cy) Re <= (Zy9-Zy,)PFH ~ 1/@ 


Z52-Zo4 


Cue + Cy = (Zp9-Zyq)'F Mx (-) Le ((Zp9—Zoq)hF™ 41/ @ (Cy, + Cp))/ 


C,,, C;< fitting pn capacitance equation 


The element values extracted from Z27 — Z, (B. Li and S. Prasad, IEEE Transactions on 
Microwave Theory and Techniques, Vol. 47, No. 5, pp. 534-539, May 1999. ©1999 TREE). 


in the base-emitter capacitance Cpe is small and Cpe can be approximated to be propor- 
tional to Jg, and we also have re « 1/Jg. Therefore, the y intercept of plot of Le — re Coe 
versus 1/Jg gives the value of Le. Once Le is known, Che can be easily calculated. 

The extraction from Z22 — Z21 is somewhat more complex. Rpe is extracted from the 
expression Re(Z22 — Z21)w Cs at the intermediate frequency or from the real part of 
Z22 — Z2) at the low frequency. Rg is the vertical axis intercept of the plot of Re(Z22 — 
Z21) versus 1/ w~ in the high frequency range. Ce + Cpe is extracted from (Z22 — Z21) x 
(—q) at the intermediate frequency. There are two different assumptions about Cr. One 
assumes that Cr is bias-independent. Therefore, we can obtain C¢ and C}, by fitting 
Cr + Cpe into the following bias-dependent equation: 


(1 VBc ) Mijpe ° 
Vibe 


On the other hand, Cs might be considered to vary with the bias rather than be fixed 
as in the Gummel—Poon model. The fraction of Cy in the base—collector capacitance is 
constant. For this case, given Rpx, Rpi and Cpc/Cs can be obtained from the real part of 
Z\1—Z}2 at the intermediate frequency or calculated from a knowledge of the geometry. 
We can calculate Cp, and Cf once the fraction is given. 

Lg is calculated by the expression Im[(Z22 — Z21) + 1/(@(Coc + C))]/@ in the high 
frequency range. 

a can be extracted easily once we know R, and Le. This is shown in Figure 3.47. The 
extraction of parameters related to a has been reported by Pehlke and Pavlidis [19]. 


Cet Coc = Cr+ (3.84) 
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The approximation at Rp; = 0 


This is the case in which the distributed effect of the base—collector region is not neces- 
sarily taken into account. The equivalent circuit is reduced and so are the Z-parameters. 
Let Rpi = 0 in Equations (3.51)—(3.54), we have 


Zi, — Z12 = Ryo + joLy (3.85) 

Z12 = Ret joLle + re — joreCre (3.86) 
Z\2 — Zx1 =a x Zpc (3.87) 
Zn. — Zo, = Ro+ jal. + Zp. (3.88) 


The element parameter extraction becomes trivial in this particular situation. 
In the extreme low frequency range, 


Z2 — Z21 = Ro + joLle + Roe — J RpeCre- (3.89) 


In the intermediate frequency range, 


1 


Zo2 — Z23 = Ro t+ jol. + = + ‘ 
ee joCbec wo RpcCp, 


(3.90) 


Rp and Lp are given by Equation (3.85). Equation (3.86) gives RE, re, Le and Cte. 
Equation (3.88) gives Re, Rpc, Cpc and Le. The extraction of a is the same as described 
above. 

The T small-signal equivalent circuit for HBTs after de-embedding pad capacitances 
has been used to derive the Z-parameter expressions. The simplification of the Z- 
parameters based on the range of measured frequencies was developed in order to 
provide guidelines for directly extracting element parameters. The information from the 
measured S-parameters is normally overwhelming and different schemes of extraction 
procedure could be developed from these approximations. One fully analytical extrac- 
tion procedure has been provided here. Z-parameters at multiple biases are utilised to 
extract the small-signal parameters. These results can also be used for the parameter 
extractions of BJTs. 


Small-signal model of the collector-up (inverted) HBT 


The device used in this example is a 5 x 10j1m? InGaAs/InAlAs/InP inverted HBT 
with fr = 23GHz and fmax = 20GHz. The small-signal equivalent circuit of the 
device [17, 18] is shown in Figure 3.50. 

Lp, Le and Le and Cpbe, Cpbc and Cpce are parasitic inductances and capacitances 
respectively, and Rp, R, and Re are extrinsic resistances. The active portion of the 
HBT is modelled by intrinsic elements Ce, re, Cie, @ and Rice, where a = ap/[1 + 
i(f/fu)le@". of is the de value of the transport factor, t is the transit time of collector 
current and fy is the a 3 dB frequency. RF-measurements indicate that the HBT under 
zero bias (Ih = 0A, Vcg = 0 V) can be represented by a passive network. Therefore, in 
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= Cobe r == = Crce 


E 


Small-signal equivalent circuit of the inverted HBT (B. Li and S. Prasad, JEEE Transactions on 
Microwave Theory and Techniques, Vol. 45, No. 7, pp. 1135-1137, July 1997. ©1997 TEEE). 


this case, the transport factor a is negligible. Moreover, the dynamic resistance of the 
base—collector p—n junction and the base-emitter p—n junction is assumed to be very 
large. The uncertainty of numerical optimisation in zero bias can be reduced. The initial 
values that are assumed in optimisation are calculated from the physical and geomet- 
rical parameters. The parasitic elements, Cpbe, Cpbe, Cpce, Lb, Le and Le, are obtained 
from zero bias numerical optimisation and assumed to be invariant with bias [18]. Their 
values are listed in Table 1. 

The other elements under non-zero bias are extracted by the following analytical 
approach: 


e Convert the S-parameters to Z-parameters and remove the parasitic series elements 
Ly and Le; 

e Convert the Z-parameters to Y-parameters and remove the parasitic shunt elements 
Cpbe: Cpce and Cpbe: 

e Convert the Y-parameters to h-parameters. 


The elements of the equivalent circuit, excluding the parasitic effects, are easily 
extracted using the procedure described in [19]. 

The elements Rp, Le, Re and Re are basically constant over the entire frequency 
range of interest and do not show significant variation with bias. Therefore, 
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these elements can be considered to be fixed. The bias-dependent elements are 
Ce, Cie, Ric, dF, T, fu and re. The consideration of the bias variation of these elements 
is sufficient for accurate small-signal modelling [17, 18]. 


Problems 


(1) 


(2) 


(3) 


(4) 


The scattering parameters of the HEMT in the common source configuration 
are given. The Y-parameters are related to the equivalent circuit parameters as 
follows: 


Yi + Yi2 = joCgs (3.91) 

—Yi2 = joCep (3.92) 
Y22 + Yi2 = gp + j@oCps (3.93) 
Y21-Yi2 = a (3.94) 


where 
=( 
= (3.95) 
&D 
Ln(Vcos — Vr) 
op = gs (3.96) 


The scattering parameters were measured at Vos — Vr = 0.4 V. The mobility 
Ln = 4400 cm?/V-s and L = 1 um. Vps = 0.5 V. gm = 0.049S, gp = 0.014S. 


Use the SA algorithm to obtain optimised values of the equivalent circuit after 
converting the scattering parameters to admittance parameters [14] as shown in 
Table 3.7. 


What are the advantages of combinatorial optimisation methods over analytical 
methods of device optimisation? What are the disadvantages? 


A researcher wants to optimise the parameters of a small-signal equivalent circuit 

for the HBT using the genetic algorithm. The parameters are shown in Table 3.8. 

Assume that there is a procedure for measuring the S-parameters of the HBT for 

each variation of the parameters of the circuit shown in the table. 

(a) Write down the steps of a genetic algorithm for the optimisation. 

(b) Write a fitness function for the genetic algorithm. 

(c) Modify this genetic algorithm for optimising the small-signal parameters of 
a FET. Compare the difficulty of doing this to the difficulty of modifying an 
analytic method for the same procedure. 

A student wants to create a neural network model for a power amplifier with 

the parameters shown in Table 3.9. He/she decides to use the SGA to simultane- 

ously optimise both the weight values and the number of neurons in the neural 
network. 


More free ebooks : http://fast-file.blogspot.com 


Optimisation and parameter extraction of circuit models 


Table 3.7 HEMT scattering parameters 


Frequency (GHz) Si S12 S21 S22 

5 0.552 — 158 0.05 21.80 0.53214.5 0.832176 
10 0.752 — 166 0.062180 0.63218.5 0.822173 
20 0.82179 0.07226 0.68 228.5 0.822167 
30 0.8262 169 0.07235 0.079 233.5 0.832161 
40 0.792161 0.009233 0.09 234.5 0.812155 


Table 3.8 Circuit parameters 


Component Definition 

Lp(pH) Base inductance 
L¢(pH) Collector inductance 
Le(pH) Emitter inductance 
Cc (fF) Collector capacitance 
Ce(pF) Emitter capacitance 
Rp (2) Base resistance 
R_(Q) Collector resistance 
T(ps) Transit time 

Bo Current gain 


Table 3.9 Neural network inputs and outputs 


Inputs 


Gate length 
Gate voltage 
DC drain current 


Outputs 


Output power 
DC gate current 


Drain current 


(a) Draw a diagram of a chromosome that can be used by the SGA. 
(b) Is there any other way to simultaneously optimise both the weight values and 
the number of neurons in the network? 


(5) The scattering parameters for an HBT are given in Table 3.10. 


215 


Use the semi-analytical parameter extraction method to determine the ele- 
ment values in the equivalent circuit and then obtain the scattering parameters 
to compare with the given measured parameters. 


(6) Assume that the base—collector extrinsic capacitance could be lumped into the 
intrinsic Cpe and develop the strategy for the small-signal model parameter 
extraction procedure. 
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Table 3.10 HBT S-parameters 


Frequency (GHz) |Syi] -ZSyy_— (Saal ZSaa— Saal ZSy2—s [S22] ZS22 


0.45 
0.5 
0.6 
0.7 
0.8 
0.9 


(7) 


(8) 


(9) 


(10) 


0.955 —4.3 4.651 174.83 0.008 85.1 0.997  —2.78 
0.954 —4.83 4.648 174.25 0.009 84.2 0.996 —3.1 

0.953 —5.78 4.638 173.19 0.01 84.11 0.996 —3.73 
0.952 —6.71 4.631 172.13 0.012 83.44 0.995 —4.32 
0.951 —7.66 4.622 171.05 0.013 82.7 0.994 —4.93 
0.95 —8.68 4.614 170.05 0.015 82.79 0.993 —5.51 
0.948 —9.52 4.601 168.94 0.016 82.61 0.992 —6.12 
0.92 —18.95 4.457 158.29 0.034 75.73 0.974 —12.12 


0.833, —35.63 4.005 139.02 0.062 66.77 0.921 —22.78 
0.726 -—49.36 3.482 122.93 0.086 55.57 0.853 —31.24 
0.628  -—60.18 3.001 109.46 0.102 48.19 0.797 —38.1 

0.544 —68.72 2.593 98.35 0.115 42.08 0.75  —43.67 
0.476 —74.48 2.264 89.05 0.123 37 0.716 —48.35 
0.43 —79.5 2.009 81.24 0.132 34.11 0.694 —52.23 
0.39 —84.14 = 1.805 73.81 0.141 30.94 0.68  —56.36 
0.352 —87.56 1.633 66.91 0.149 27.43 0.669 —60.25 
0.326 —88.9 1.489 61.32 0.153 25.01 0.663 —63.83 
0.306 —89.49 1.382 55.44 0.158 22.24 0.661 —67.38 
0.299 —90.98 1.293 50.96 0.163 20.41 0.662 —70.79 
0.292 —94.44 1.227 46.22 0.167 19.91 0.672 —73.97 
0.283. —95.78 1.173 41.1 0.176 18.33 0.68 —77.75 
0.272 —98.22 1.127 35.42 0.184 16.84 0.69 —81.95 
0.2569 —99.7 1.073 29.92 0.194 14.21 0.688 —85.81 
0.246 —102.13 1.024 25.25 0.201 11.68 0.692 —89.2 

0.238 —100.66 0.974 21.13 0.206 8.92 0.697 —92.91 
0.232 —101.63 0.963 16.41 0.213 6.63 0.699 —96.4 

0.223 —101.26 0.918 11.77 0.22 3.11 0.694 —99.63 


What are the advantages of the inverted (also known as the emitter-down or 
collector-up) HBT? If it is promising in some power application, why is it not 
popular yet? Compare the emitter-up HBT and the emitter-down HBT in terms 
of the device parameters? 


Use the small-signal equivalent circuit of the collector-up HBT (Figure 3.50) to 
determine the basic expressions for the circuit parameters. 


The base—collector capacitance Cs is much smaller in the inverted HBT. Does 
that make the model parameter extraction easy? 


In compact device modelling, layout pads for measurement are not the part 

of the device which needs to be removed (De-embedding procedure) from the 

measurement data. In industry, open/short structures are used. 

(a) Design your own open/short structure. How do you think the pad size, ground 
layout, probe type or frequency affect your design? 

(b) Draw the equivalent circuit for the open structure. 

(c) Draw the equivalent circuit for the short structure. 

(d) The S-parameters for the open/short structures are given. What information 
can you extract from them? 


More free ebooks : http://fast-file.blogspot.com 


Fig. 3.51 


Fig. 3.52 


Optimisation and parameter extraction of circuit models 217 


(11) An engineer is developing the diode model at a particular bias. Outline the pro- 
cedure he/she is going to develop. Use the given S-parameter file. If there is 
inductance in the structure that he/she forgets to remove, how will that affect the 
final result? If the resistor and inductor are ignored, do you get the right result? 
Explain your answer. 

(12) There is a set of S-parameter files which were taken for the diode. Use the files to 
extract the diode capacitance model parameters, assuming that parasitic elements 
have been de-embedded out. 

(13) Given the HBT S-parameters measured at the cold condition, use what you 
learned to extract the model parameters. 

(14) Given the HBT S-parameters measured at the constant Vce condition, use what 
you learned to extract the model parameters. 

(15) Given the HBT S-parameters measured at constant J) condition, use what you 
learned to extract the model parameters. 

(16) The intrinsic part of the small-signal equivalent circuit of the MESFET which is 
similar to the hybrid pi model of the bipolar transistor is given in Figure 3.51. 
Outline the extraction procedure to extract the model parameters assuming 
negligible gas. Now the full equivalent circuit of the MESFET is given in 
Figure 3.52. Assume Lg = Lg =25 pH, Ls = 10 pH, Rs = Ra = 10 2, Rg =1 Q, 

Coa 
Go | | OD 
+ + + 
w TS 

Vgs J Im Vj C) lds "T Vas 

i. 

j 
SO : OSs 

C>> Cga 
Intrinsic MESFET circuit. 
Intrinsic device 
eerie ee 
L Rg i Coa Ra Lg 

G 999. ww T i AWW 5p— D 


MESFET equivalent circuit. 
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Cog = Cpa = 25 fF and gg, = 0S. Extract the other model parameters based on 
the given set of S-parameters. 


Note that the necessary data files are available on the Web labelled according to the 
problem numbers. 
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Executive summary 


All land line communication systems currently use optical fibres as the channel due to 
their very low attenuation, and deployed systems operate up to 40 Gigabits per second 
(Gbps), with experimental systems at 160 Gbps and higher. These high speed systems 
use very stable sources, usually the distributed feedback lasers, with external modu- 
lators and very fast detectors. Direct modulation of lasers leads to chirp, caused by 
the laser frequency varying when modulated, and therefore is not used in these sys- 
tems. However, interest remains in high speed lasers for several other applications. This 
chapter discusses high speed optoelectronic devices which include light-emitting diodes 
(LEDs), semiconductor lasers, photoconductors, p—i—n diode photodetectors, avalanche 
photodetectors, metal-semiconductor—metal photodetectors, travelling wave photode- 
tectors and briefly the phototransistor. The physics of the devices are outlined and then 
the parameters that make these devices fast are discussed with specific examples from 
the literature. 


Optical sources 


Two types of sources are widely used in optical communication systems, and they are 
the LED and the semiconductor laser. LEDs are inherently slow; their response times 
are determined by the lifetime of the carriers in the active region, and in most LEDs 
this is between 2ns and 10ns. Thus, small-signal response of LEDs is of the order of 
100 MHz, and the large-signal response is smaller than this. Lasers are complex devices, 
and the response time is determined by the so-called relaxation oscillation frequency, 
and this is due to the interaction between the photons which have a finite lifetime, which 
is a measure of the cavity quality factor, Q, the differential gain of the structure, the 
lifetime of the carriers and other parameters [1]. Most lasers, when directly modulated, 
suffer from spectral broadening, and therefore direct modulation is not used for long 
haul fibre-optic systems; but when this broadening is not important, direct modulation 
may be used. 

In this section, the fundamental aspects of LEDs and lasers are discussed, followed 
by an outline of the different types of lasers. To calculate the response of lasers, the rate 
equations are introduced and these are solved analytically for the relaxation frequency 
jr. A short discussion of the noise mechanisms of lasers follows. A further section is 
included on the very high f; lasers. 
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Preliminaries 


The generation of non-thermal light in a semiconductor requires the creation of hole— 
electron pairs which recombine radiatively to emit photons at the bandgap energy, or 
impurity level to valence band energy. The more recent source of light is the intra- 
band relaxation of electrons from a higher energy level to a lower energy level in the 
conduction band of a quantum well, to emit photons, and this has resulted in the quan- 
tum cascade lasers. In both these sources of light, a means of generation of the excess 
electron-hole pairs or excitation of the carriers to a higher energy state requires the 
expenditure of energy. If this energy is from an electrical source, then this is called elec- 
trical pumping leading to electroluminescent emission. Alternatively, optical pumping 
with photons of a higher energy than the emitted photons is also a possibility. In this 
chapter, only electroluminescent sources are considered, and quantum cascade lasers 
are not discussed. 


Light-emitting diodes 


The ubiquitous green and red LEDs are fabricated in GaP which is an indirect gap semi- 
conductor. The impurities of zinc oxide and nitrogen form isoelectronic bands which 
give rise to the red and green emissions, respectively. Other LEDs include the blue GaN 
device, the near infrared GaAs 870 nm device and the InGaAsP 1300 nm and 1550 nm 
devices. 

The mechanisms that create the LED are based on p-n junctions. A forward-biased 

p—n junction injects minority carriers to both sides of the junction, and these diffuse 
away from the junction to recombine radiatively and non-radiatively. The radiative 
recombination results in emission of photons, with energy hv approximately equal to 
the bandgap energy Eg. The recombination is spontaneous, which implies that emission 
has random phase, and the linewidth is of the order of a few kgT, where kg is Boltz- 
mann’s constant, and T is the temperature in kelvin, and the emission is incoherent. The 
internal quantum efficiency is a measure of how efficiently the injected carriers produce 
light, since some of them recombine non-radiatively. Define the total recombination rate 
Riotal, in recombination events per second, as the sum of the radiative recombination rate 
Ry, and the non-radiative recombination rate Rp;. Then, the internal quantum efficiency 
is given by the ratio of the radiative recombination rate to the total rate: 
= Rir _ Rr 
- Rtotal a Ry + Rig: 
However, lifetimes are more easily measured or inferred, and the quantum efficiency is 
recast in terms of the lifetimes. The radiative recombination rate is defined as the ratio 
of the non-equilibrium carrier density N to the radiative lifetime t,,, and similarly, the 
non-radiative recombination rate is the ratio of the carrier density N to the non-radiative 
lifetime Tp;, and thus Ry = N/t, and Ry, = N/Tpr. It follows that the total lifetime 
Ttotal IS given by 


Ni (4.1) 


= (4.2) 
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Hence, substituting in the Equation (4.1), the internal quantum efficiency is given by 
Tor 
; = ———_.. (4.3) 
™ Tor + Trr 


The total lifetime is obtained from the definition 


N 
Ror = Ry = Rtotal = , (4.4) 


Total 


and the total recombination rate Rtotal is given by 
Riotal = AnrN =F B,N?, (4.5) 


where Anr iS the non-radiative recombination coefficient and B, is the radiative 
recombination coefficient. It follows that 


Total = (Anr + ByN)1. (4.6) 


In direct gap semiconductors, the radiative lifetime is comparable to the non-radiative 
lifetime, whereas in indirect gap semiconductors, the non-radiative lifetime is much 
shorter than the radiative lifetime. Thus, the internal quantum efficiency is about 0.5 
for direct gap semiconductors and this improves very considerably for lasers to almost 
unity when the radiative lifetime becomes very small compared to the non-radiative life- 
time. In indirect gap semiconductors, silicon and germanium for example, the internal 
quantum efficiency is of the order of 10~>. Surface recombination is also a problem as 
this is non-radiative and decreases the internal quantum efficiency. To reduce this, a het- 
erojunction layer of a higher band gap may be deposited above the emission layer. The 
other problem is the absorption of the emitted photons in the generating layer, and there 
is little that can be done about this. The higher bandgap layer has very low absorption, 
and this heterolayer may help marginally. 

Consider the structure of the LED which is usually a p—n or p*—n or n*-p junction. 
It is necessary to have the junction close to the surface so that the emitted light is able to 
escape from the material into the air. Initially, consider the carrier injection process in 


a p—n junction when it is forward-biased. A p—n junction when forward-biased injects 
holes into the n region and electrons in the p region. With forward bias of V¢, it can be 
shown that the current density in the p—n junction is given by 


J=q (4 +4 wn) (et Me/ hat =i), (4.7) 
Lp Ly 
where Dp and Dy are the minority hole and minority electron diffusion constants in 
the n-type and p-type material respectively adjacent to the junction; Lp and Ly are 
the diffusion lengths of the minority holes and minority electrons in the n- and p-type 
material respectively adjacent to the junction; ppp and npo are the minority hole and 
electron densities in the n- and p-type material; g is the charge magnitude of an electron; 
kg is Boltzmann’s constant; T is the temperature in kelvin and Lpn = \/Tp,nDp,n. In 
some LEDs, the junction is n*—p, in which case ppg in the n* region is very small, as it 
is equal to n? / n*, and in many III-V materials nj, the intrinsic density, is very low, for 
example in GaAs it is about 10° cm~?. The hole injection term which is the first term in 
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Sketch of the n*+-p junction, with a very thin pt and the contacts of the LED. Also shown is the 
effect of total internal reflection in the escape of light from the p* layer. 


Equation (4.7) is small, and the injection is largely from the n* region into the p region. 
Thus, the injection efficiency, which is the ratio of the injected current into the p region 
to the total injected current is almost unity due to the fact that the injected current into 
the n* region is much smaller than that injected into the p region. This is the case for the 
p*—n junction as well, since the minority carrier density in the highly doped region is 
always much smaller than that in the lower doped region, and the injection efficiency is 
assumed to be unity. However, as indicated above, the internal quantum efficiency is of 
the order of 50% for direct gap semiconductors. As the junction temperature rises, the 
non-radiative recombination increases and so this figure is typical for room temperature 
devices. 

Assuming that the injection is primarily from the n* side of the nt—p junction into 
the p region. The minority electron distribution in the p layer is of the form Ane~<//", 
where the z direction is normal to the surface in Figure 4.1, and z is the distance from 
the junction towards the surface. This minority carrier distribution may be shown to be 
equivalent to npo(e4 V/keT _ 1)e~</Lp_ Thus, the radiative recombination is over this 
distribution, and for all practical purposes it can be shown that this is equivalent to an 
uniform distribution over the distance Ly. Assume that the thickness of the active layer 
is d, which is assumed to be much larger than the diffusion length Ly. Then, integrating 
this distribution 


foe) 
/ Ane~“/-ndz = AnLy, (4.8) 
0 


which shows that An may be considered uniformly distributed over Ly. The choice 
of the upper limit of 00 is because d >> Ly. The lifetimes in GaAs and InGaAsP vary 
from 2 ns to about 10 ns, and mobility of minority carriers of 1000 cm? (V.s)~! gives Ln 
values of less than 0.5 jm, and the active layer thickness may exceed this value. In the 
following sections, the tacit assumption is that the minority carrier density is uniform 
over Ly. 

In Figure 4.1, the top p layer has the injected minority carriers recombining to cre- 
ate the photon source. The spontaneous emission is isotropic and the trajectories of the 
photons at the surface to the air region result in total internal reflection when the inci- 
dence angle exceeds the critical angle. Thus, extraction efficiency of the emission needs 
to take into account this total internal reflection, when the incidence angle exceeds the 
critical angle given by 
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Geant (4.9) 
n2 
where ng is the index of air equal to | and nz is the index of the top layer of the LED 
through which the emitted photons escape. For normal incidence of the photons, the 
reflection coefficient I is given by 
_ No n2 


r= ‘ 4.10 
no +n2 ( ) 


The fractional transmitted power or the transmissivity (1 — ['7) for normal incidence, 
and is given by 
4non2 


eg ea 


(4.11) 
As the photon angle of incidence varies from the normal to 6c, the reflection coefficient 
changes, depending on the polarisation, which averages out between the two perpendic- 
ular and parallel cases for the spontaneous emission, and the transmissivity also changes 
[7]. The external quantum efficiency is obtained from 


1% 
Next = — ; T (0)2x sin@dé. (4.12) 
4a 0 


Since the expressions for the transmissivity vary with 6, and are difficult to inte- 
grate, T(@) is replaced by T(0). Substituting for T(0) from Equation (4.11) and using 
Equation (4.9), the external quantum efficiency becomes 


1 


nz(ng + nz)? —— 


Next = 
For a value of n2 of 3.5 and no of unity, Next is of the order of 1.4%, which suggests that 
most of the light generated is trapped inside the device. The presence of a heterojunction 
layer on the top surface, discussed below, complicates this expression as the index of 
this top layer is lower than that of the active layer. An anti-reflection coating on the 
surface helps to improve this factor considerably. The obvious method of extracting 
light from the p-layer is to place a hemispherical lens, index identical to the p-layer, on 
the surface. If there is a hetero-layer, then this needs to modified further. In the case of 
the p layer, the extraction efficiency becomes very high, but the problem is getting a 
suitable lens with the same index. 

The optical power emitted by the LED is given by 


I 
Popt = MintMext (AV) (4.14) 


where I/q is the number of electrons or holes that are injected into the active region 
per second; the internal quantum efficiency nin defines the fraction of them that recom- 
bine radiatively; and the external quantum efficiency 7ext the fraction of the generated 
photons that escape from the active layer. The total quantum efficiency of the LED is 
a measure of its performance and is the ratio of the output optical power to the input 
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electrical power, which is given as Pp = Vol, where Vo is the voltage drop across the 
device and the current is given by J. Thus, substituting from Equation (4.14) 


hv 


Mtotal = intext = - (4.15) 
qVo 


Neglecting contact resistance drops, hv ~ E, ~ qVo, where Eg is the band gap in 
eV, which then makes the total quantum efficiency, which is also the external power 
efficiency or the wall plug efficiency: 


Notal ~ NintMext- (4.16) 


This is usually less than a few percent, unless other techniques are used to extract the 
light more efficiently. Edge-emitting LEDs, discussed below, are therefore much more 
efficient. For visible LEDs the luminosity is also an issue [3], but this is not considered 
here. 

The responsivity R of the LED is defined as the ratio of the emitted optical power to 
the current, and substituting from Equation (4.14), is given by 


hv 
R= OG (4.17) 


This is of the order of 0.01 W A7! for the above values of 7 unless this becomes much 
larger. 

The spectral width of the emission Av of LEDs is approximately defined by kgT/qh 
in Hz and peaks at (Eg + kgT/2q)/h [1]. The full width half maximum (FWHM) is 
~1.8kgT/qh, and at room temperature (T = 300K) is about 11 THz. The spectral 
width in wavelength Ad varies as AA = Av(A*/c), and so varies from about 30 nm to 
90 nm. 


Modulation response 
The LED is effectively a n*—p junction, and therefore the usual diode current relation- 
ship holds, as discussed above: 


T = Ip(e4"/*87 — 1) (4.18) 


To calculate the modulation response, the small-signal behaviour is obtained by initially 
biasing the LED at some bias point, where the current is given by Jy. The carrier lifetime 
in the active layer determines the modulation rate of these LEDs. If N is the density 
of carriers and J is the current that flows into the LED, the following rate equation 
determines the carrier dynamics: 


dN I ON 
dt - qVol Ttotal 


(4.19) 


where Vol is the volume of the active region. Under steady-state conditions, the time 
dependence is zero and at a bias current of J}, then 
_ Ty Total 


Nb = ; 4.20 
b= Fvol (4.20) 
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where Nj is the carrier density at bias current /,. To obtain the small-signal response, 
let the current have an ac component given by Je/®™’, and similarly the carrier density 
also has a small-signal term Nme/2™'. Note that it is assumed that Im < Ip the bias 
current, and similarly Nm << Np. Thus 


I(t) = Ib + Imelo™ (4.21) 
N(t) = Np + Nme/2™. (4.22) 


Substituting in Equation (4.19) two component equations arise; the steady-state equa- 
tion is satisfied by the result in Equation (4.20). The time varying equation gives rise to 
the following solution 
TmTtotal 
(qVol) 
1+ j@mTtotal 


Nm(@m) = (4.23) 


The expression for optical power output given in Equation (4.14) varies as J/g, which 
is proportional to the carrier density. Thus, the modulated optical power Pm(@) varies 
as Ny». It follows that 


1 
1/2” 
[1 i (o2, teat] 


where Popt is the zero frequency steady-state output power. The frequency at which half 
power is obtained by setting the denominator of the above equation to 2, which leads to 


fraw = V3 : 


27 Ttotal 


Pm(@m) = Popt (4.24) 


(4.25) 


This equation confirms that the bandwidth is inversely proportional to the carrier 
lifetime. 


LED structures 

The basic planar LED is shown in Figure 4.2. In general, it is necessary to have a thin 
highly doped contact layer to reduce the contact resistance, and the emerging light has 
to pass through it. The absorption creates additional loss. If the active layer is intrinsic 
or undoped, with pt and n* layers on either side, then the injection into this layer is 
from both junctions. In practice, the intrinsic layer is always unintentionally doped as 
either p- orn, where one of the junctions is a p—n junction and the other is a high-low 
junction; but injection takes place from both junctions. 


A 


Contact metal 
layer 
aN as Pt layer 


Pp active layer 


n+ substrate 


Contact metal layer 


Sketch of the nt —p junction LED with a p* contact layer. 


More free ebooks : http://fast-file.blogspot.com 


228 


Fig. 4.3 


Fig. 4.4 


High-Speed Electronics and Optoelectronics 


Contact metal 
layer > n*-GaAs contact layer 
ee +_ 
fon AlGaAs layer 
—— GaAs active layer 


_—— p*-AlGaAs layer 


_—— p*-GaAs substrate 
Contact metal layer 


Schematic diagram of the heterojunction LED. 
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Lower contact 


Schematic diagram of the edge-emitting LED. 


A variation of this structure is to have higher bandgap heterostructure layers both 
above and below the active layer, appropriately doped. The diffusion of the injected 
electrons and holes injected from the active layer is blocked by the heterojunctions that 
are now formed on both sides of the active layer, as shown in Figure 4.3. These hetero- 
layers prevent the diffusion of the minority carriers to the surface, and therefore prevent 
surface recombination. Most LEDs designed at the current time use the heterolayers 
where available. 

A further variation on this is the placement of dielectric mirrors below the active layer 
to reflect the light emitted towards the substrate. A second variant is the edge-emitting 
LED shown in Figure 4.4, in which the heterolayers above and below the active layers 
together with the active layer act as a waveguide. A high reflectivity mirror at one facet 
makes this a superluminescent diode; with mirrors on both ends and with adequate gain, 
the LED may operate as a laser. The external quantum efficiency of the edge-emitting 
LED is very much higher than the surface-emitting devices because the transmissivity at 
the edge facet is of the order of 0.7 for the waveguide index ng of 3.5. With a high reflec- 
tivity facet coating at one end, the external quantum efficiency is also about 0.7 for ng 
of 3.5. Anti-reflection coating on transmitting facet would increase the transmissivity to 
almost unity, and the external quantum efficiency also becomes nearly unity. Thus, the 
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total quantum efficiency or the wall plug efficiency of edge-emitting LEDs is determined 
largely by nint, which is of the order of 0.5 in direct gap semiconductors. 

The normal emission of the surface-emitting LED is Lambertian, which implies that 
the emission intensity at angle 9 from the normal is cos @, which means that the beam 
width is 120°. For the edge-emitting LED, the emission is elliptic in form, and the beam 
width is about 30° in the horizontal plane but remains 120° in the vertical plane. 

LEDs are used in optical fibre communication systems that operate with multimode 
fibres, as the broad emission spectrum prevents modal noise being a problem [17]. A 
diode proposed by Burrus [4] has the fibre bonded into the face of the surface-emitting 
LED. 


Semiconductor lasers 


While LEDs utilise spontaneous emission for the emitted light, lasers operate on stimu- 
lated emission-generated light. Lasers are optical oscillators in which the gain medium 
is in a cavity; the light acquires gain in the medium between reflections from the ends of 
the cavity until steady state is reached, when the gain becomes saturated. The simplest 
version is a Fabry Perot cavity with the gain medium between two mirrors. 

In this section, semiconductor lasers are discussed, as they may be designed to be 
high-speed and high frequency lasers. While solid state, fibre and other types of lasers 
may produce extremely short pulses, known as ultrafast lasers, their repetition rates are 
typically 80 MHz, to a maximum of a few GHz, and are not considered ‘high-speed’ 
or ‘high frequency’ lasers, and therefore not discussed here. The semiconductor laser 
threshold condition is first considered, then waveguides used in these lasers are outlined, 
and different types of lasers are discussed. This is followed by the derivation of the rate 
equation for these lasers, and the solutions for various conditions. Discussion of noise in 
semiconductor lasers is also included. Subsequently, quantum well, quantum dot lasers 
and vertical cavity lasers are briefly discussed. 


Basic concepts 

When the gain medium is in a cavity formed by two mirrors, shown schematically in 
Figure 4.5, light in the form of an electromagnetic wave, electric field amplitude €, 
travels from one end of the cavity to the other end, where it is reflected by the end 


Mirrors 


Gain medium 


z=0 z=L 


Schematic diagram of gain medium in a cavity formed by two end mirrors. 
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mirror, and then propagates back, and is then reflected by the second mirror to return to 
its starting point. Suppose that the length of the cavity is L, this determines the distance 
travelled by the wave between reflections. Assume that the gain of the medium is given 
by g/2 Nepers per unit length, and hence the intensity gain is g per cm. Suppose the 
medium internal loss is aint /2 Nepers per unit length, or the intensity loss is a per cm, 
the medium index is n, and the reflection coefficients of the mirrors are r; and rz. For 
the structure to commence oscillation, the loop gain should be unity or larger, which 
implies that the field at the starting point €o has to undergo these reflections and is also 
subjected to gain to become after one round trip 


Eq = Eq(2L) = Ene J*hort ela— atin) L |p || ry. (4.26) 


The real part of this equation gives 


1 il 
& = Aint + —1 


—————e 4.27 
EL" Gril mie) 


The mirror reflectivity is defined as Rj = |r1|?, and similarly Ry = |r2|*. Substituting 
in the above equation, 


1 
& =n + — | = Qint + Omir- (4.28) 


21" (RiR2) 
The first term on the right-hand side is the medium loss without pumping and the second 
term is the mirror loss term, usually denoted by aj. For the gain medium of GaAs, the 
index is about 3.45, and the mirror is assumed to be formed by cleaved facets which 
result in plane mirrors, parallel to each other. The reflection coefficient of each facet 
mirror is 
3.45 — 1 


= ——— =0.55l. 4.29 
3.45 + 1 Sa 


r1.2 


The facet mirror reflectivity is 0.303, and hence the mirror loss term air is 1.194 em7!, 
The internal loss term is usually between 10 and 20cm7!, and therefore the gain needs 
to be 11.194-21.194cm7! for the round trip gain to be unity, and generally the gain 
needs to be higher than this value. 

In practice, only part of the wave obtains gain from the active region, and this is 
defined by the confinement factor I, which is discussed later. Thus, the oscillation 
condition for the laser in Equation (4.28) becomes 


lg = Qint + Omir. (4.30) 


Also the complex propagation constant in the gain medium now is given by 


B =nk — j (5) , (4.31) 
where 7 is the guide effective index and the loss term a is the net loss, and given by 


Q@ = Ain + Omir — Tg. (4.32) 
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mn 


Plane interface between medium 1, index n;, and medium 2, index n2, and the incident 
and transmitted light directions given by 6; and 62 respectively to the normal. 


The imaginary part of Equation (4.26) determines the phase requirement: 


mc 
2konL = 2m or Vy = ——, 4,33 

on m m (nL) ( ) 

where m is the longitudinal mode number, which may take values of 1, 2,3,..., but 


cannot be zero, as the solution is then trivial. Note that the mode spacing in frequency 
is given by c/(2nL). All longitudinal modes, with the values of m, satisfy this equation 
but only some of them are valid for a device. Since the medium gain is band-limited, the 
modes within this gain region are all excited at threshold when the gain is just larger than 
the losses. With increasing gain, generated by current pumping, the mode competition 
results in those close to the gain peak growing at the expense of the other modes. 


Optical waveguides in semiconductors lasers 

The light in the gain medium discussed above needs to be confined and guided, and this 
requires that the medium is in the form of an optical waveguide. An optical waveguide 
utilises the concept of total internal reflection that occurs when light emerges from a 
higher index medium to a lower index medium. In Figure 4.6, light in the form of an 
electromagnetic plane wave in a medium of index m2 is incident on the plane interface 
between the media, at an angle 92 to the normal. The second medium index is given by 
n1, With n2 > nj, and the light emerges into the second medium at an angle 62, obtained 
from Snell’s law: 


n2 sin 0 =n, sin al (4.34) 


Since nz > nj, it follows that 6; > 62. At the critical angle of 62, the value of 6; 
becomes 2/2, which implies that the emerging light travels along the interface. For inci- 
dent angles greater than 62¢, total internal reflection occurs, and with a second interface 
below the first, similar total internal reflection occurs so that the light remains confined 
to the high index region as shown in Figure 4.7. A similar confinement may occur in 
the plane normal to this to obtain two-dimensional guiding with appropriate layers to 
provide for an index guiding structure. 

The solution of the wave equation for this three-layer guide used in a laser results 
in both even and odd modes. The even mode is sketched in Figure 4.8. Semiconduc- 
tor lasers are classified as gain-guided lasers or index-guided lasers. In gain-guided 
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Slab waveguide which provides confinement in the transverse plane. 
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Three-layer slab symmetric guide, which is typical for laser structures: the upper layer is p-type, 
the guide layer of higher index, is generally undoped, and the lower layer is n-type. The guide 
layer thickness is d. The even mode field distribution is sketched . 


lasers, the mode confinement in the lateral direction, the horizontal direction, is not 
designed into the structure, and the mode is guided by the gain region of the structure. 
In index-guided lasers, the waveguide is well defined and the mode is confined in both 
the transverse direction, the vertical direction normal to the plane of the wafer and the 
lateral direction. A very popular structure among research scientists is the ridge laser, 
which is weakly index-guided. In all these lasers, the transverse confinement is obtained 
by the design of the heterolayers. Gain-guided, index-guided and weakly index-guided 
lasers are shown schematically in Figure 4.9. 

Optical waveguides are analysed using Maxwell’s equations, and the solution of the 
two-dimensional guide problem may be performed by a variety of methods. The results 
of this analysis enables the active region which acts as the guide to be designed. These 
guides are generally designed to be single mode in the x — y plane. The width of the 
guide is usually designated by w, its thickness by d and the length of the laser is L, 
leading to an active volume of Lwd. The current is assumed to flow in the contact over 
the length of laser and the width of w, which results in a current flow area of Lw. The 
analysis of the guide is generally performed, assuming that there is no loss or gain, and 
these terms are added as perturbations. The analysis determines the propagation con- 
stant of the guide £, and also its effective relative permittivity €;ef¢ and effective index 
neff. The phase and group velocities are different in optical guides, and the correspond- 
ing effective indices are also obtained. An important parameter is the confinement factor 
I. This defines the fraction of the power contained in the active guide region to the total 
power in the particular mode of the guide. 
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Schematic diagram of the gain-guided laser, the index-guided laser and the ridge laser which is a 
weakly index-guided structure. 


The effective index method is discussed next; it leads to approximate analytic expres- 
sions for the design of the guide layer. In this method, the guiding in the transverse, 
vertical, direction is analysed, and effective indices are calculated for each of the 
different transverse regions as they vary in the lateral direction. With the effective 
indices known for these transverse regions in the lateral, horizontal, direction, the one- 
dimensional guide in the lateral direction may be analysed, to obtain the solution of the 
entire guide. 

Maxwell’s equations lead to the Helmholtz’s equation, assuming that the cross- 
section of the waveguide remains constant in the z direction and the propagation 
constant does not vary with z. Thus, the wave equation is of the form: 


V7E + G(x, y)KRE = 0, (4.35) 


where €,(x, y) is the structure permittivity which varies with both x and y directions 
but does not vary with the z direction. The dielectric constant also varies with pumping, 
typically the increase in carriers results in a small decrease in index, and vice versa. The 
dielectric constant is complex due to absorption or gain, and these effects are added as 
perturbations. Ignoring these effects enables the modes of the guiding structure to be 
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obtained, and using the effective index approach allows both index-guided structures 
and gain-guided laser structures to be analysed. 

In the effective index approach, the assumption is that the solution may be separated 
into a y-varying, transverse varying, component in the form of a slab waveguide, and a 
similar effective index varying guide in the x direction. Assume that the solution of the 
Equation (4.35) obtained by the separation of variables is of the form: 


E =ak(y; x)W(xye IP, (4.36) 


where f is the propagation constant and a is unit vector in the direction of the €-field, 
€, which defines the mode polarisation. Substituting in Helmholtz’s Equation (4.35), 


lady 1d 7, 2 
F dt + Fa t Mats») - #7] =0. (4.37) 
where ko = @,/(€o/40), which defines the propagation constant in free space. 

The next step is to solve the transverse, y directed, field distribution and with it the 
effective propagation constant Berr(x) for a fixed value of x. Using the transverse part 
of the Equation (4.37) 


a 
ee 


[Hea y)- Ban(x)] & =0. (4.38) 


The lateral, x directed, field distribution and the propagation constant 6 are then 
obtained from the equation: 


dy 

dx? 
Consider the transverse modes of a typical laser structure, which may have as many as 
four or five layers in the slab guide form. However, the principle of the design is obtained 
from the three-layer guide shown in Figure 4.8. The solution of Equation (4.38) for the 
even TE mode (€ is directed along the x, lateral, direction) is of the form: 


+ [62 - p°| py =0. (4.39) 


& = Aecos(ky) for |y| < d/2 (4.40) 
= BeeWVI-4/2) for |y| > d/2, (4.41) 
where 
Kk = ko (n3 = ng) (4.42) 
y = ko (nq — ny” (4.43) 


Note that m; and n2 are the refractive indices of the cladding and the guide layer 
respectively, and nz > n, for guiding. 

In the TE mode case, the only components of field present are Ex, Hy and Hz. The 
boundary conditions require the continuity of € and d&/dy at |y| = d/2, and these cor- 
respond to the continuity of the €, component across the interfaces, and the continuity 
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of the 7, component across the interfaces, respectively. This leads to the following two 
equations: 


kd 
Ae cos) =B, (4.44) 
and 
._ (Kd 
K Ae sin( ) = Be. (4.45) 
Dividing the above two equations, 
kd 
K tan( “) — (4.46) 


The solution of this equation gives the values of Bere from which the effective index is 
obtained through the relationship Ber = konege. 
For the odd TE modes, it may be shown that the dispersion relationship becomes 


—K cot(“) =y. (4.47) 


This is obtained by setting the initial solution for € in Equation (4.40) as sin(x y) instead 
of cos(ky). 

For the TM modes, the € vector is along the y direction, normal to the interfaces 
in Figure 4.8. In this case, the only components of fields present are €y, 7x and €;. 
From the Maxwell curl equations, €; is proportional to d€y/dy. Thus, the boundary 
conditions are the continuity of the y component of electric flux across the interfaces 
and the continuity of €, across the interfaces. These lead to the following equations for 
the even TM modes: 


d 
neAg cos(S) = n?B, (4.48) 


and 
_ [Kd 
KAe sin( ) = VBae. (4.49) 


Dividing the above two equations gives the dispersion relationship: 
2 kd 2 
Knj tan aE = ny. (4.50) 
For the odd TM mode, the dispersion relationship is given by 
2 Kd 2 
— Kn} cot a =nyjy. (4.51) 


Consider the TE mode solutions, squaring Equations (4.42) and (4.43), and adding 
results in 


e+y=k (n3 = n;) (4.52) 


This is the equation of a circle in the «—y plane, and the intersection of the circle with 
the curves defined by the Equations (4.46) and (4.47) provides the modal solutions for 
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these equations. Note that multiple solutions are likely to occur as both tan and cot are 
periodic functions, and also depending on the parameters of the guide, defined by 7, 
n2, d and the wavelength which defines ko. 

At the cutoff of the guide which implies that the guide is no longer guiding, then 
y = 0. Note that y may not become negative, as it would imply exponential growth 
in the cladding region which is non-physical, and therefore the smallest value y takes 
zero. When y = 0, then from Equations (4.46) and (4.47) 


kd = pr, (4.53) 


where p is an integer whose even and odd values satisfy Equations (4.46) and (4.47) 
respectively, corresponding to the even and odd modes. Now let 


D= pr =kd. (4.54) 


Now for y = 0, Equation (4.52) becomes 
0.5 
D=%% (n3 = ni) d, (4.55) 


where D is the normalized guide layer thickness. For a single transverse TE mode 
guide, this requires D < z. The layer thickness d for a single transverse mode guide is 
obtained as follows. Setting kg = 27/A in the above Equations gives 


d < ; (x3 — ni) —_ (4.56) 


For GaAs/AIGaAs at 870 nm wavelength, with indices of 3.45 and 3.41, this suggests 
the guide thickness of less than 830nm. The usual thickness is of the order of 0.2— 
0.5 wm. If the AlGaAs layer has a higher value of index, closer to that of GaAs, then the 
thickness of this layer may be larger. For InGaAsP lasers [2], the layer thickness is also 
of the order of 0.2 |1m, and in this case the layer thickness needs to be less than 0.48 jum. 
According to [2], this relationship holds for lasers in the wavelength range 1.1—1.65 jm. 
The confinement factor in the transverse direction I‘7 is a measure of how much of the 
mode power lies in the guide active region, and is calculated from the equation: 


d/2 
an &)dy 
y= 


fe E2(y)dy ” 


Performing the integration and simplifying, [2] derives this as 


(4.57) 


D2 
= 24 Dy 


The effective index of the guide in the transverse direction is given as [2]: 


ly (4.58) 


ee = ni +Tr7 (n3 _ ni) ‘ (4.59) 


The lateral modes are next evaluated. The loss of laser structures may be as high as 
5-10Npcm!, and therefore in principle these losses need to be taken into account. 
However, when pumped, the gain which is typically of the order of 50 Npcm7! 
larger, allows the guide to become transparent, which implies that the loss is cancelled 


and 
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and neff? vertically, with nefp2 > Neff) for the index-guided laser and the weakly guiding ridge 
laser. 


by the gain. However, if the losses need to be accounted for, this is usually performed by 
the perturbation technique. Thus, the lateral modes are calculated in a similar manner 
as the transverse modes, assuming the guide is lossless. In this case, the wave equation 
to be solved is given by Equation (4.39) and repeated here for convenience: 


aw 


oat [Péin(x) = "| row (4.60) 


Note that Bere(x) = konete(x) which is obtained from Equation (4.59), and thus the prop- 
agation constant of the total guide 6 is obtained. In the case of the index-guided laser in 
Figure 4.9, the transverse effective index in the main guide region has the largest effec- 
tive index, and the regions outside have lower effective indices. As shown in Figure 4.10, 
the symmetric slab guide in the vertical direction with the effective indices calculated 
from the transverse slab guide equation. Here, nefp2 > Nerf) So that the guide acts with 
the centre region as the larger index. 

In this case, the TE mode from the transverse slab guide becomes a TM mode and the 
TM mode becomes a TE mode. Based on the earlier approach, the width of the central 
region is set as w, and the normalised width as W. Then, it follows that 
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0.5 
W=ko(ndm— nan) w (4.61) 
and 
W =qn. (4.62) 
For the lowest order mode 
A (2 ye 
w< 5 ( - nes) i (4.63) 


In this case, the effective indices take different values, but this provides an indication to 
the width of the guide which usually is in the 3 zm region. 
The lateral confinement factor I, following the earlier derivation, is given by 
w2 
lL = ——_—.. 4.64 
is Q+W) (4.64) 


The waveguide mode refractive index is now given by 
ere = Meee + UL Mete — Nese): (4.65) 
The laser confinement factor T’ is the product of 'y and Ty and is given by 
Te Trn, (4.66) 


The two most important parameters that are obtained from this analysis are the effec- 
tive index of the guide and the confinement factor. From the effective index, the guide 
propagation constant may be obtained. 

These results are for the index-guided structures, and the same technique may be 
used for the weakly guiding ridge structure in Figure 4.9. The analysis of the gain- 
guided laser is much more complex, and interested readers are referred to the paper by 
Nash [24]. 


Emission characteristics 

The waveguide which guides the light has facet mirrors on both sides of the device. 
These may be coated to obtain high reflectivity or to reduce the reflectivity as desired, 
but generally these facets are uncoated, in which case the reflectivities are equal. 

With current pumping, the light intensity against input current is plotted in the L-I 
characteristic as shown schematically in Figure 4.11. Note the different regions of this 
curve: the pre-threshold where the laser output is from spontaneous emission, the thresh- 
old current at which the device starts to lase, which is the linear region, followed by the 
saturation region due to gain saturation. The current into the laser is of the form: 


Tf owiJ, (4.67) 


where w is the width of the active region and L is the cavity length. In practice, index- 
guided lasers in Figure 4.9 have leakage current through the various reverse-biased 
junctions, and should be added to this current expression, but this is ignored in the 
present discussion. 
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Schematic diagram of the light intensity against current drive, the LI characteristic. 


Assume that the gain is given approximately by the expression 
g =a(N — No), (4.68) 


where a is the gain coefficient, equal to (dg/0N), N is the carrier density, and No is 
the carrier density at which transparency is obtained, when population inversion occurs. 
The threshold carrier density is defined when the product of the confinement factor and 
the gain is equal to the loss, or 


Pa(Nin — No) = Qmir + Qint, (4.69) 


or 
Nth = No + (Qmir + Aint) /(al). (4.70) 


The number of carriers pumped into the active region per second is I/q. However, 
since the equations are in carrier density, the number of carriers injected per second 
per unit volume is [/(qVol) = I/(qwLd) = J/qd. The loss of the carrier density is 
through recombination at the rate R(N), which also includes the stimulated emission 
recombination, and this is defined as Riotal. Thus, the rate equation for the carriers is 
given by 

dN J J N 

= = 77 Row = T= - = RstimN ph, (4.71) 

dt qd qd Te 
where Te is the carrier lifetime and the last term is the stimulated emission recombi- 
nation. At and below threshold, this last term may be omitted as the contribution from 
stimulated emission is small. At steady state, the time variation is zero, and hence 


J = qd Roa. (4.72) 
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The stimulated emission recombination rate is RstimNph, where Npn is the photon 
density and Rgim is defined by 


Cc 
Rstim = 78): (4.73) 


where g(J) is the gain defined in Equation (4.68), c/n is the group velocity. 
From Equations (4.70) and (4.71), the threshold current density is obtained as 


dN 
Gas (4.74) 
Te (Min) 
where, neglecting stimulated emission recombination close to threshold, 
1 
= (Ap + ByN + CN*), 4.75 
Z(N) (Anr + BN + ) (4.75) 


where Apr is non-radiative recombination coefficient, B, is the radiative recombina- 
tion coefficient and C is the Auger non-radiative recombination coefficient, which is 
important for long wavelength lasers. 

When the laser operates beyond threshold, the carrier density is clamped at the 
threshold value, and further injection results in conversion into photons by stimulated 
emission. Thus, Equation (4.72) may be written as 


Nth 
J = qdRoota = qd— + q4 Rstim Nph- (4.76) 
e 


Substituting from Equation (4.73), and replacing g by @int + Omir at threshold, 
J— Jn = qdvg (int + Omir) Nph- (4.77) 


The photon density in the cavity depends on the carriers injected into the cavity and the 
quantum efficiency of the material. However, the photons in the cavity travel at the group 
velocity of the medium vg, and are either absorbed due to internal losses or escape from 
the facets. Thus, the photons have a finite lifetime in the cavity, and the photon lifetime 
Tp is given by 

1 


t= ——_——__ (4.78) 
Ug (Qmir + Cint) 


The Equation (4.77) becomes 


Nph 
J — Jn = qd— (4.79) 
Tp 

The relationship between the injected carrier density per second and the photon density 
in the cavity is now obtained. The carriers recombine at a rate defined by the carrier 
lifetime, to produce photons, and this is accounted for by the quantum efficiency of the 
material from Equation (4.3), and the photon density also decays at the photon lifetime. 
Thus, 
(J = Jin) _ Noh 


4.80 
a = (4.80) 


Nint 
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or 
(J — Jin) 


qd 
This equation shows that the photon density in the cavity increases linearly with current 
density above the threshold current density. 

The output power per facet is obtained by the product of energy of each photon hv, the 
photon loss per facet vg@mir/2, the active laser volume Vol and the photon density Npn: 


Noh = NintTp (4.81) 


1 
Pracet = 5b vgomir VolN pn (4.82) 
where Vol = Lwd. Substituting for Non 
hv Omir 
Pracet = Mint Lw(J — Jin). (4.83) 


2q Omir + int 
Since J = Lw J, this equation becomes 
h 


v 
Pracet = Mint 
2q Amir + Aint 


Omir 


(I — Inn). (4.84) 


Note that the threshold current is obtained from Equation (4.74) 
qLwdNin 


Te 


Ih = (4.85) 


The total output power Pout = 2 Pracet and the differential (external) quantum efficiency 


is given by 
q ( i) 
hv a Omir 


a awe — In) - EE a + int 


(4.86) 


This is proportional to the slope of the L-I curve. Note that in the laser the nint needs 
to take into account the stimulated emission rate. Then, the total recombination rate 
becomes 


Reotal = AncN + BpN? + CN? + RetimNoph- (4.87) 


The coefficients have been identified in Equation (4.75) except for Rstim, which is 
the stimulated emission rate and given in equation (4.73) Thus, the internal quantum 
efficiency now is given by 

Riad B,N? + Rstim Nph 


int = = (4.88) 
Tint "Rout Au + BN? + CN? + ReaaNoh 


Above threshold, the Rstim pn dominates and is much larger than the other terms, which 
makes nint almost unity. 

The power efficiency of the laser is given by 
Pout hv Omir (I — In) 
VI qgV (Qmir + Qint) = 


p= ; (4.89) 


where V is the applied bias in volts. 
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The L-I curves for a buried heterostructure laser at different temperatures. The inset plots the 
threshold current against temperature to obtain To ( R. J. Nelson, R. B. Wilson, P. D. Wright, 
P. A. Barnes, N. K. Dutta, JEEE Journal of Quantum Electronics, Vol. 17, No. 2, pp. 202—207, 
1981. ©1981 IEEE). 


The temperature dependence of the threshold current density varies as 
Jn = Jenoe™/™. (4.90) 


A typical example of the variation of the L—I curves for different temperatures is shown 
in Figure 4.12 [25], and the inset in this figure is also the plot of threshold current with 
temperature to obtain 7p. 

This figure shows that as the temperature rises, the threshold current increases as the 
recombination becomes increasingly non-radiative. 

The edge-emitting laser has a large number of longitudinal modes within the gain 
region, and at threshold all these are excited. Figure 4.13 shows these modes, and at 
threshold only those modes that have enough gain to neutralise the loss finally emerge. 
In this figure, two modes are shown to have this property, and in general it is possible 
that only one of these modes is likely to be dominant and the other becomes a secondary 
mode as the current and hence the gain is increased. 

In Figure 4.14, the spectrum of the emission is plotted relative to the current excitation 
along the L-I curve, and this is also from [25]. 

The spectrum narrows from several modes close to threshold to a dominant mode 
with a few subsidiary modes at higher drive currents. The ratio of the intensity of the 
dominant mode to the next highest mode expressed in dBs is a measure of the mode 
suppression ratio (MSR). Since there is no guarantee that the Fabry—Perot laser with 
facet mirrors will produce a device with a single dominant mode with a large MSR of 
at least 20 dB, other techniques for mode selection, such as gratings in the active region 
or outside, may be used. 
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Fig. 4.13 The longitudinal modes of the laser within the gain region of the laser, showing the modes that 
are likely to lase when the gain is equal to the loss, including the mirror loss. 
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Fig. 4.14 The L-I curve of a buried heterostructure laser, together with the mode spectrum at different 


current levels of excitation (R. J. Nelson, R. B. Wilson, P. D. Wright, P. A. Barnes, N. K. Dutta, 
IEEE Journal of Quantum Electronics, Vol. 17, No. 2, pp. 202-207, 1981. ©1981 IEEE). 
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The laser illuminates the facets and these fields determine the near field pattern of 
the laser. It is usual to approximate the fields as a Gaussian distribution in the trans- 
verse direction, and a similar Gaussian distribution in the lateral direction (see [1]). The 
product of these Gaussians give rise to an elliptic distribution of the E-field on the facet. 
The far-field beam pattern is obtained using the usual methods by the spatial Fourier 
transform of the near field pattern [1]. 


Calculation of absorption, emission and gain 

The calculation of the absorption, emission and gain in a semiconductor is a complex 
process, and will not be given here, as the method has been discussed in several text- 
books, for example in [8]. Figure 4.15 shows the calculated gain/absorption spectra of 
InGaAsP for different levels of carrier injection, with the gain peak shifting as the carrier 
density increases [9]. 


Rate equations 

Since lasers have a complex relationship between the injected carriers and the photons 
generated, the calculation of the dynamic response is more involved than that in LEDs. 
Essentially, the laser operates as an oscillator with a cavity, which is many wavelengths 
long, except in specific cases. Therefore, the cavity has many resonances corresponding 
to the expressions in Equation (4.33) in which the resonance frequencies are given by 
Vm = mc/2nL, for different values of m. The corresponding radial frequency for the 
mth mode is assumed to be Qy = 27Vm, and the corresponding wave number k is 
given by 

NQm mit 


kn = =; 4.91 
m . 7 (4.91) 


For the present, the subscript m is omitted for convenience. The laser radial frequency w 
is undetermined, but nearly coincides with the cavity radial frequency &2. The effective 
permittivity €;ef¢ is defined by the effective index of the laser guide, but the perturbation 
of gain and loss has to be added. However, the use of the effective permittivity or index 
reduces the wave equation to the one-dimensional form. 

The injection of carriers into the laser causes a small change in index of the various 
layers, and for the active layer this takes the form: 


An = DN, (4.92) 


where b is equal to dn/0N, and has a small negative value. Define 


2kob oS 
Be = ——— = —2kg (#) (4.93) 


a 


which results in 6. « b, but is dimensionless, and also positive. 6, has been described 
as the anti-guiding parameter, or the linewidth enhancement factor. 
The effective index is now given by 


Neff =n+TAn. (4.94) 
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The absorption/gain spectra for different levels of carrier injection. Reprinted with permission 
from N. K. Dutta, Journal of Applied Physics, Vol. 51, pp. 6095-6100, 1980. ©1980, American 
Institute of Physics. 


This assumes that An is mainly in the active guide due to carriers, which is strictly not 
correct, as the carriers have to be injected from the top and bottom contacts. However, 
both the electrons and the holes end up in the active region to recombine and so this is 
an acceptable approximation. This assumption also simplifies equations to be derived 
below. 
Also, the effective permittivity is given by 
2 ,na 2 jna 
€reft = (1 + TAn)* — Ie xn + 20nAn — — (4.95) 


The one-dimensional wave equation applied to a cavity requires a solution in the form 
sinkmz, with km defined in Equation (4.91), neglecting the effects of the end mirrors. 
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The electric field wave equation for the cavity is 


@E -efp d2E 
al (4.96) 


dz? c2 dt? 


and the solution is of the form: 


E(z,t) = Do sin(kjz)A(e!, (4.97) 


where A(t) is assumed to be slowly varying in time compared to the light wave fre- 
quency. Assuming this is a single longitudinal mode laser, the summation and the 
subscripts i and j are dropped. Substituting this in the wave Equation (4.96) and 
neglecting the second time derivative of A 


2 . 
ro) 2jo dA 
—RA+ az sre — Fret = 0. (4.98) 
Simplifying with k* ~ Q?n?/c?: 
2 22 . 
@) Q¢n 2jw ,dA 
(3 €retf a A 2 n oe 0. (4.99) 


However, since the laser frequency @ is very close to the cavity frequency Q, then 
(w* — 2?) & 2a(w — Q). With further simplifications, substituting for €;er¢ from 
Equation (4.95), this equation becomes 


dA n jo a 
=—j Q TA j 4.1 
a Ihe (@ )A ig ( n inn) A, (4.100) 


where 7g is the group index. 
Separate this equation into its real and imaginary parts by substituting 


A= Ael? (4.101) 
into (4.100) to obtain for the real and imaginary parts: 
dA a 1 
it Oe A= 5 ells Omir — Mint) A (4.102) 
and 
d 
P poy ae. (4.103) 
dt Ng 


The rate equations may be written in terms of the photon density Npp and carrier density 
N, or in terms of total number of photons and total number of carriers in the laser active 
volume, and both these approaches have been used in the literature. In the following 
derivations, the total number of photons and carriers in the laser volume are used. The 
total number of photons in the cavity is obtained from the equation: 

2 


Ss i Vol E7dV, (4.104) 
hv 
and since A* « S, then multiplying Equation (4.102) by A, this equation becomes 
dS 
ros (G — Yp)S + Rop, (4.105) 
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where 


G =Tvgg (4.106) 


is the normalised gain, which is the stimulated emission rate. The photon decay rate is 
given by 


1 
Yp = Ug (int + Omir) = = (4.107) 
P 


The term Rs, has been added to account for the spontaneous emission in the lasing 
process. The spontaneous emission takes place over the whole laser cavity but only a 
small fraction Bsp) couples into the waveguide mode, which is the integral part of the 
laser. Rsp may be written as 


Rsp = BsptspYeMt, (4.108) 
where 


Ye = (Aur + BN + CN?) = = (4.109) 
e 
and sp is the spontaneous emission factor, which is usually a fitting parameter with 
values of 10~4-10-°, according to Agrawal [2]. The term 7.) = B,N/ye is the internal 
quatum efficiency, and is the fraction of carriers that recombine to emit photons through 
spontaneous emission. 
N, is the total number of carriers in the active volume obtained by integrating the 
carrier density N over the volume: 


m= | wav. (4.110) 


The phase Equation (4.103) needs further simplification. From Equations (4.92) and 
(4.93) 


Be Be 
Me NX Ag: 4111 
ion oa ha en 


The last term in Equation (4.103) becomes 


1 
@TAn = —=BTugdg. (4.112) 
Ng 2 


Now AG = TugAg, and replacing AG by G — yp, Equation (4.103) becomes 


1 
= —-(@ tn) + 5 Be(G Yp): (4.113) 


The first term on the right-hand side in the above equation has been expanded in terms 
of the threshold frequency, and at threshold the cavity frequency Q is very close to the 


threshold frequency. 
The carrier rate equation given above in Equation (4.71) is 
oem! a RgimN, (4.114) 
dt = qd Te stim/Vph- : 
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The expression for Rgtim is given by 
c 
Rstim = 7 8) = ugg (N). (4.115) 
& 


Equation (4.75) defines the carrier lifetime t.. Since the volumetric values of N and Non 
are used in the photonic rate equations, integrating this equation over the active volume 
results in 


I 
= veM — GS, (4.116) 
q 


where ve = 1|/Te. The confinement factor is introduced in the last term to convert the 
expression to G, and the total photon number S is obtained from the integration of the 
photon density Npn. 

In practice, Fabry—Perot lasers with facet mirrors are longitudinally multimode, and 
therefore the rate equations apply to each individual mode, and have to be solved 
simultaneously. Thus, the rate equations become 


Sin = (Gn — ¥p) Sin + Rep(@m) (4.117) 


a, 
N= = eM — ) GSm. (4.118) 
q m 


The rate equations developed above allow the calculation of the L—TJ curve, the longi- 
tudinal mode spectrum and the MSR, among other properties of the laser, provided the 
parameters are known. The dynamic behaviour which includes the turn-on delay, the 
modal behaviour, small- and large-signal modulation may also be calculated from these 
equations. The calculation of noise requires the addition of the noise sources to these 
equations, and these may also be obtained. In the following section, some aspects of the 
calculation methods for the steady-state and dynamic behaviour are discussed. 


Steady-state and dynamic characteristics 
Under steady-state conditions, for the single longitudinal mode laser in Equation 
(4.105), the time variation is zero. This makes this equation: 


(G — Yp)S + Rep(w) = 0, (4.119) 


which becomes 


, (4.120) 


which states that the spontaneous emission photons in the cavity are created by the 
injected current. When the net stimulation emission rate G is nearly equal to the photon 
decay rate yp, then threshold is reached. The value of G is a little below the decay rate at 
threshold, and as the gain increases, G is asymptotic to 7p, but the denominator should 
always remain positive. 
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Power output against current (G. P. Agrawal and N. K. Dutta, Semiconductor Lasers, Van 
Norstrand Rheinhold, 1993. ©Springer). With kind permission of Springer Science and 
Business Media. 


Substituting in the carrier rate Equation (4.116) 


oe N+R a (4.121) 
no t er cs . 


This equation may be used to calculate the light intensity output against current, the L—-I 
curve. Lee [19] has discussed the method of solution, as this is a non-linear equation. 
Figure 4.16 shows the results of the calculation of output power against current from 
[2] for the particular laser that has been modelled. 

The time evolution of the build up in the carrier levels and the photon levels are 
shown in Figure 4.17 from Marcuse’s paper [21] for a multimode laser when the cur- 
rent has been increased as a step function from 0 to 1.5 Jn. The rate equations are 
solved by numerical calculations outlined in [21], and the results are shown in the 
Figure 4.17. 

The time delay of the build up to stimulated emission is typically of the order of 3 ns, 
and this is the reason why lasers are biased just below threshold current when they are 
pulsed on and off. The output power at this bias level is extremely low and so for most 
purposes may be regarded as negligible. Note the oscillations in the light power output 
at the relaxation oscillation frequency of the laser. This relaxation oscillation frequency 
jf; determines the maximum small-signal response of the laser, and this may be derived 
from the above rate equations. Also it may be shown that the relative intensity noise 
of the laser also peaks at the relaxation oscillation frequency, and thus modulation at 
or near this frequency needs to be avoided to improve the signal-to-noise ratio of the 
detected signal. 
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Transient response for a laser in which the current is increased as a step function from 0 to 
1.5 It), showing the carrier density and power in the fundamental mode and other modes 
(D. Marcuse, T. P. Lee, IEEE Journal of Quantum Electronics Vol. QE-19, No. 9, 

pp. 1397-1406, 1983. ©1983 IEEE). 


This figure shows that the oscillations are damped and in the form of e~@r4i%)!, 


where I’, is the decay rate and , is the frequency of oscillation, which is the radial 


relaxation oscillation frequency in radians s~!. 


To calculate the relaxation oscillation frequency, the rate Equations (4.105) and 
(4.116), repeated here for convenience, are used. 


S = (G—Y)S + Rep (4.122) 
: I 
N= — — veM — GS. (4.123) 
q 
Consider perturbations to S and NM; given by 6S and dN. Additionally, 
G(M, S)=G+ Gy,dM+ GsdS, (4.124) 


where Gy, = 0G/dM, and Gs = 0G/0S. Substituting in the rate equations, separating 
the perturbed terms, results in 


: IRsp 
68 = —I's6S + (Gy,S+ 5M (4.125) 
aM 
5N, = Tn,d5M — (G + GsS)dS, (4.126) 
where 
Rs 
P's = —* — GsS, (4.127) 
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and 


OVe 
OM 


Py, = ve + M ( ) + Gy,S. (4.128) 


To solve these equations, the assumption is that these perturbations decay at the rate 
shown in the time evolution result in Figure 4.16. 
Thus, let 
5S = dSpe7™ (4.129) 
dN, = SNe", (4.130) 


where 6 Sp and 6 No are the initial values of the perturbations, and 
h=T,+ jQy. (4.131) 


The real part is the decay rate 


1 
r= 70m + Ps), (4.132) 


and the radial relaxation oscillation frequency, after some approximations, is given by 


Q, © (GGy,S)!”. (4.133) 
With further substitutions for 
I-I 
5 U-') (4.134) 
(qG) 
Ith = Ve Nt-th (4.135) 
ége = (4.136) 
Ne Vol” 
the expression for the relaxation frequency becomes 
1+TvgaNotp ( I Me 
Gus ee tol ( 1)] (4.137) 
TpTe Th 


where No is the transparency carrier density. Derivation of the small-signal response 
shows that Q, is the key parameter for high-speed lasers; the larger this is, the higher 
the laser response frequency. The term vga NoTp evaluates to a number close to unity, 
and therefore may not be neglected. However, making tp smaller increases the value of 
Q;. This reduction in tp may be obtained by decreasing the facet mirror reflectivities 
by coatings, or alternatively by making the laser guide more lossy. The parameter Te 
which is the carrier recombination rate is initially determined by the material. After 
threshold this recombination rate becomes shorter, and little can be done to reduce it 
further. Reducing Jj, and increasing the ratio J/ Jj, also increases the magnitude of Q,. 

The small-signal modulation response calculation uses the rate equations, but in this 
case the phase Equation (4.103) is also featured. The derivation assumes that the laser 
is biased above threshold at some current J}, and the small-signal current modulation 
term, modulation at @m, is given by J sin(@mt). The expression for the total current is 
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I(t) = Ih + Im sin(@mf), (4.138) 


with the assumption that Im < (ph — Jin), which implies small-signal modulation. 
The total cavity photons may be written as 


S(t) = Sp + Sm Sin(@mt + Om), (4.139) 


where Sp is the cavity photons at the bias current of J, and the second term is the 
sinusoidal time varying component, with a phase term which usually lags the current 
component. The carriers in the cavity have a similar expression given by 


Ni (t) = M—b + M—m sin(@mt + &m), (4.140) 


with the phase delay &,,, which differs from the photon number delay. 

Substituting in the rate equations, the solutions for the various small-signal terms 
may be obtained. The small-signal photons in the cavity, which is also a measure of the 
output power, may be shown to be 

Gy, SIm/9 


Sm = 4.141 
” [(@2, — 2 — T2)2 + 402,72]95 oe 


and the phase lag term for the photons is given by 


= 21am 
6m = tan7! (£5) (4.142) 
, wr, _ Q2 _ c? 


The modulated cavity photons or equivalently the modulated output light power is 
almost constant when am « &,, then peaks to a maximum near Q,, and falls off for 
@m > Q;. Note that the laser is a forward-biased p—i—n diode, and therefore additional 
series resistance due to contact resistance and parasitic inductance and capacitance from 
the bond wire and pads results in further degradation of the response. This is shown 
[33] in Figure 4.18. It may be shown that the 3 dB bandwidth is given by [1], assuming 
lr < Q;: 


fsap = = (4.143) 


When the laser current is modulated, the increase and decrease in the current result in 
the laser effective index varying inversely as the current according to Equation (4.92). 
Thus, the longitudinal mode cavity changes its electrical length since the index changes 
with current injection. As the modulation takes place the frequency of emission of 
the laser keeps changing in synchronism with this modulation. Using the phase rate 
equation, it may be shown that the change in frequency results in a frequency chirp 
given by [1]: 


1 
dv(t) = —— = 


o-#| 
In dt 4n 


1 
Gy,.(Mt — Mo) — =| : (4.144) 
Tp 
where f, is the linewidth broadening factor of Equation (4.93) and 


Nt-0 = f mav. (4.145) 


More free ebooks : http://fast-file.blogspot.com 


Fig. 4.18 


Optoelectronics 253 


10 - 
—— MODEL 
= @ ce x MEASURED DATA 
3 
wu 0 
ep) 
Zz 
9S PARASITICS 
gy 5 
lw 
a 2 
s x |)=80mA 
O -10}- 
z 
5 
2 -15L 
fo} 
= 
~20- 
25 
0 1 2 3 4 5 6 7 


FREQUENCY (GHz) 


Modulation response for different current drives, showing the effect of parasitics, from [33] with 
permission (R. S. Tucker and I. P. Kaminow, Journal of Lightwave Technology, Vol. 2, No. 4, 
pp. 385-393, 1984. ©1984 IEEE, OSA). 


This linewidth broadening that occurs with chirp is unacceptable for long haul fibre 
optic systems, and therefore an external modulator is used with dc current into the laser. 

The process of obtaining a single longitudinal mode requires a grating in the wave- 
guide either in the whole of the gain section which makes it the distributed feedback 
(DFB) laser or at the ends as the grating acts as a frequency selective reflector which 
makes it a distributed Bragg reflector (DBR) laser. This does ensure that a single 
emission line emerges from the laser, and feeds into an external modulator. 


Laser noise 

The semiconductor laser has noise associated with the emission, and the primary source 
of noise is the spontaneous emission, followed by the carrier recombination noise, 
which is essentially shot noise. Since this is in a cavity, the oscillations are affected 
by an amplitude or intensity noise component called the Relative intensity noise (RIN), 
and a phase noise component which affects the linewidth of the emission. The method 
of analysis is to include the noise terms in the rate equations, and traditionally these are 
the Langevin noise components, which added to the rate equations. These noise terms 
are assumed to be Gaussian, with a zero mean, and the correlations are assumed to be 
Markovian, which simplifies the equations. Thus, the rate equations become 


dS 
a (G — Yp)S + Repon + Fs(t) (4.146) 
dN 1 
t= -y¥%M-GS+ Fy (4.147) 
dt qd 
do 1 
Fp Te In) + 5 Be(G — ¥p) + Felt), (4.148) 
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where F(t), Fy, and Fy(t) are the Langevin noise components. The mean of each of 
these components is zero, but their auto- and cross-correlation terms result in so-called 
diffusion components which have specific values. Thus, 


< F(t) >=0 (4.149) 

< K(t), R(t’) > = 2Dyd(t — 1’) (4.150) 

where i, j,k are S, N;,, and it may be shown [2] that the dominant contributions 
are from the auto-correlation terms Dsg = RgpS and Dog = Rsp/4S. The method of 
solution is by perturbations of all three variables; these lead to equations which give 
rise to the auto-correlation factors for the perturbations, and consequently noise spec- 
tral density is obtained through the Fourier transform. Thus, the spectral density of the 


photon number or the light intensity leads to the relative intensity noise. The intensity 
auto-correlation is given by Ags, and thus 


Ass(t) = < 6S(@)8S(t +t) > S2, (4.151) 


where S is the time average photon intensity and 5S = S—S, represents the fluctuations, 
and hence the noise. Fourier transform of Ass gives the RIN 


ee) : 

RIN = / Ass(t)e!@ “dt, (4.152) 

—0o 

and it may be shown that [2] 

2Rspl (CR, + 7) + Gy,S?(1 + yeMi/RspS) — 20N,Gn, 5S] 
S[(Q; — w)? +12] [(Q, +o)? + 12] 


From this expression, the RIN may be plotted as a function of frequency as shown in 


RIN = (4.153) 


Figure 4.19 [2]. The RIN peaks at the relaxation oscillation frequency, and therefore it 
follows that direct modulation of lasers should not be performed close to /;. 
It may also be shown that the signal-to-noise ratio is given by 


rs 


sp 


SNR = ( (4.154) 
Using the phase rate equation with the Langevin noise terms (Equation (4.148)), it may 
be shown [2] that the frequency noise spectral density is given by the expression in the 
following equation. The frequency noise is the phase noise integrated over time: 


204 
Are = Rep 1+ Be we ; (4.155) 
28 [(Q2 — ©)? + 2oP,)?] 


The above expression shows that the term is flat in the region w < &,, and similar to 
the RIN, it peaks at the relaxation oscillation frequency (2,, and then falls off rapidly. 

The linewidth fluctuation is obtained by considering the electric field fluctuations, and 
including both the amplitude and phase fluctuations. After considerable simplifications, 
the FWHM Aw = 27 Af of the line, assumed to be a Lorentzian, is given by [2] 


Rsp(1 + 62) 


Af = 
f Ar $ 


(4.156) 
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Calculated RIN for a typical 1.3 zm InGaAsP laser at different output power levels 
(G. P. Agrawal and N. K. Dutta, Semiconductor Lasers, Van Norstrand Rheinhold, 1993. 
©Springer). With kind permission of Springer Science and Business Media. 


This may be written as 


Af =(1+ BAfo, (4.157) 
where 
Afo = Rsp (4.158) 


which is the unperturbed linewidth of the laser. 
Quantum well and quantum dot lasers 


The gain medium considered so far has been assumed to be bulk material, where gain 
is in the region of 100cm7!. The use of quantum wells or quantum dots as the gain 
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medium results in considerable reduction in the number of carriers injected to obtain 
population inversion and thus the threshold current. 

In traditional double heterostructure lasers, the active region is between higher 
bandgap materials, and the holes and electrons recombine in this region to emit light. 
To obtain stimulated emission, the energy difference between quasi-Fermi levels for the 
holes and electrons in this active region needs to be greater than the band gap of the 
active region. Then population inversion takes place and the light output is dominated 
by stimulated emission. When the active region becomes very narrow, comparable to 
the de Broglie wavelength in the material, quantum confinement occurs, and this region 
becomes a quantum well. 

In a quantum well the energy level in the direction across the well, the x direction 
in Figure 4.20, is quantised, and is a continuum in the other directions, y and z direc- 
tions. The depth of the well in the conduction band is AE, and in the valence band is 
AE,, and the emission energy for the recombination from Elc to Flv is greater than 
(Eg). These levels are calculated by solving the time-independent Schrodinger equation 
of the well with finite barriers for the conduction band electrons, and for the valence 
band heavy holes and light holes. The bulk material degeneracy of the heavy hole and 
light hole bands is removed in the quantum well with the heavy hole band being upper- 
most, followed by the light hole band, and at a lower level, the split-off band as shown 
schematically in Figure 4.21. The effective masses of the heavy and light holes are dif- 
ferent and therefore the levels are also different. The normal wavelength of the lasing 
level is from Elc to Elhh which is a TM mode wave. 

The density of states per unit volume for the quantum well is given by 


TM ¢j 


= 4.159 
fe Lag ( ) 


Pqw-ci = 
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Schematic band diagram of a bulk semiconductor and a quantum well. 


where 7, is the effective mass of the electrons in ith sub-band of the quantum well 
and Lay is the width of the quantum well. The density of states for the valence band is 
identical except that the appropriate effective masses are used. 

The density of states for the conduction band of bulk material, assuming parabolic 
bands, is of the form: 


h2 


2m. 3/2 
pst) = 4 ( ) Be. (4.160) 


Comparing the two, it is apparent that the density of states for the quantum well is 
independent of energy. Furthermore, it may be shown that the density of states for 
the quantum well is much smaller than that of the bulk material. Thus, the current 
density for threshold is generally much smaller than that for bulk material. The cur- 
rent state of the art structures have current densities of the order of 100 A-cm™~? per 
quantum well, in contrast to bulk structures which have threshold current densities 
of almost ten times this figure. The maximum gain of quantum wells is estimated at 
between 1000cm7! and almost 10000 cm—!. However, the confinement factor of a sin- 
gle quantum well is of the order of 0.01—0.02, which makes the net gain of the order 
of 100cm7!. In contrast, bulk structures have gains of the order of 100 cm~!, and con- 
finement factors of about 0.20-0.40. In multiple quantum well structures, the spacing 
between the wells is chosen to be sufficiently large so that the wells are not coupled. 
Typical well widths are about 10 nm, and the barriers between them are also of similar 
widths. 

Quantum well lasers come in different forms, from the single quantum well laser to 
the multi-quantum well laser, and these may be incorporated in the waveguides for lasers 
discussed earlier. Figure 4.22 sketches some of these structures from the graded index 
single quantum well laser to the separate confinement single and multiple quantum well 
lasers. 
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Schematic band diagrams of the graded index single quantum well laser structure, and separate 
confinement single and multiple quantum well laser structures. 


Strained quantum well lasers have the quantum wells with compressive or tensile 
strain, and sometimes the barriers between wells are also strained. The compressive 
strain increases the band gap, and also the separation between the heavy hole and light 
hole bands, and the emission is in TM mode. With tensile strain the band gap is reduced, 
and the gap between the light hole and heavy hole bands is reduced, and may even push 
the light hole band above the heavy hole band. With the light hole band above the heavy 
hole band, the lasers emit in the TE mode. Strained quantum well lasers have lower 
threshold current densities and are sometimes preferred over unstrained well structures. 

Quantum well lasers show better performance compared to the bulk lasers with 
reduced threshold current densities, better linewidths and better chirp performance. 
With the distributed Bragg grating to obtain single longitudinal mode with further 
modifications, these have become the lasers of choice for fibre optical systems. 

Quantum dot lasers are a class of lasers that use confinement to improve on the 
performance of the quantum well lasers. In these devices, the gain is through current 
injection into quantum dots. A quantum dot is a three-dimensional structure in which 
every dimension is less than the de Broglie wavelength in the material, and quantum 
confinement occurs. The density of states in this case is a delta function, and in prin- 
ciple the number of carriers to be injected is very small. Quantum dots are created by 
the Stranski-Krastinov growth technique [5], with further modifications, by allowing a 
strained layer to relax, and this results in self-organised dots. These dots need to have 
wetting layers to make contacts, and since single layer of dots does not provide sufficient 
gain this composite of dot layers and wetting layers need to be repeated many times. 
Lasers made of these active layers show reduced threshold current densities, linewidth 
reduction, and reduced chirp [11] compared to quantum well lasers. The major problem 
with these lasers is that the dots vary in size and so the distribution affects the linewidth. 

Vertical cavity surface emitting lasers (VCSELs) are another class of lasers that use 
quantum wells for the gain medium. In this case, the cavity is usually one wavelength 
long, and the quantum well is placed in the middle so as to provide maximum gain to 
the standing wave within the cavity. Since the gain is so small, the mirror reflectivities 
need to be extremely high, as close to unity as possible. This is obtained by a multi-layer 
dielectric stack mirror with reflectivities of 0.997 for the top output mirror and 0.999 for 
the lower mirror. A schematic diagram of a VCSEL is shown in Figure 4.23. Currently, 
VCSELs have been built in the visible, at 840 nm, 980 nm, and also at the longer wave- 
lengths of 1300nm and 1550nm. These devices may be built with sub-milliampere 


More free ebooks : http://fast-file.blogspot.com 


Fig. 4.23 


Optoelectronics 


Upper contact 
he oa ee i 


Upper mirror 
stack 


Quantum well [Cavity 


Lower mirror 
stack 


eee | 


Substrate 


Lower contact 


Schematic diagram of a single quantum well vertical cavity laser. 


259 


threshold current. These lasers are also one of the most efficient devices with very high 
wall plug efficiencies. The 840nm VCSELs are used in data-communication applica- 
tions where data is transferred between computers, processors and storage media. The 
linewidth of these lasers is generally high, of the order of 0.1 nm, and direct modulation 
of these results in linewidth broadening. These lasers find application in coarse wave- 
length division multiplexed systems, where channel spacing may be as high as 40 nm, 


and in the 10 GHz Ethernet applications. 


High-speed lasers 


High-speed lasers need to have high relaxation frequencies, 2; or f;. The expressions 


for Q; are repeated here: 
Q, = (GGy,S)!”. 


With the substitution for S, 


(I — In) 
J = 
(qG) 


results in 


fadgy 
ae [oes ») 
q 


Also note that 


Tin = Ve Nt-th 
Gn, = Puga/Vol, 
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Plot of relaxation oscillation frequency against square root of normalised current. Reprinted with 
permission from D. Tauber, G. Wang, R. S. Geels, J. E. Bowers, L. A. Coldren, Applied Physics 
Letters, Vol. 62, No. 4, 1993. Copyright 1993, American Institute of Physics. 


the expression for the relaxation frequency becomes 


1+TvgNot (I es 
is TE MENOTD ( 1)| (4.166) 
TpTe Tn 


Equation (4.161) suggests that high values of Gy, are required, which is essentially 
that 0g/0N is high per unit volume, and S' needs to be high, which implies that the 
photon density Npn should be high in the cavity. From Equation (4.163), it follows that 
In should be as low as possible. The small-signal derivation in Equation (4.145) shows 
again that f3ap is determined by Q2,. Thus, this equation may be written as 

V3Q2x 


fap © ae (4.167) 
IU 


Ideally, the quantum dot lasers should have the highest relaxation oscillation frequency. 
However, the problem of the parasitics of the wetting layer and other issues [11] result 
in these lasers not being as fast as the quantum well tunneling injection lasers discussed 


below. 

Of the lasers discussed above, the lowest threshold current device is the VCSEL, and 
early measurements have shown that the relaxation oscillation frequency measured by 
streak camera [31] was 84 GHz shown in Figure 4.24. However, the problem is that the 
parasitics of the device in its present form, with the current passing through the entire 
top and bottom mirrors, are large. Thus, the device parasitics should be made as low 
as possible, as well as the packaging parasitics, as otherwise the modulation signal will 
be attenuated severely at high frequencies. A high-speed VCSEL operating at 35 GHz 
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Plot of the band diagram of the conduction band of the tunnelling injection laser (X. Zhang, 
A. L. Gutierrez-Aitkens, D. Klotzkin, P. Bhattacharya, C. Caneau, R. Bhatt, Electronics Letters, 
Vol. 32, No. 18, pp. 1715-1717, 1996. ©1996 IEEE). 


[6] has the laser drive and modulation current pass through the top mirror but only 
through a small number of layers of the lower mirror to achieve this result. 

The approach by Bhattacharya’s group [35] has resulted in tunneling injection quan- 
tum well lasers that have the potential for very high modulation rates. The claim is 
that when the electrons travel from the cladding and fall into the separate confinement 
region, they gain energy and become hot. The electrons diffuse across the separate con- 
finement region (SC), and when they fall into the quantum wells, they become even 
more hot. The holes having a large mass are not as mobile, and therefore are largely 
unaffected by these changes in potential. The approach is to have a reservoir of elec- 
trons in the SC region, and these tunnel out into the SC region, and also into the quantum 
wells with only small gains in energy, to allow high modulation rates. The theory dis- 
cussed above for (2; does not account for the temperature of the carriers in determining 
its value. A more elaborate theory would do this, but only qualitative explanations are 
therefore available. 

Figure 4.25 shows the conduction band diagram of the device. Careful design of the 
reservoir, its width and its level with respect to the SC region and the quantum wells are 
necessary. Again the device parasitics need to be low. The results are indeed impressive, 
3 dB bandwidths of 100 GHz, and these are probably the fastest lasers built to date. Note 
that the structure used in high-speed quantum dot lasers has also used the tunnelling 
structure, but these are not as fast as the quantum lasers discussed above. 


Photodetectors 


Photodetectors are devices used to convert light signals into electrical versions. The per- 
formance of the different types of detectors is determined by their quantum efficiency, 
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their frequency response and responsivity. In this chapter only solid state detectors are 
considered, and the slower detectors are only briefly discussed. The photomultiplier, 
which is widely used as a sensitive and fast (of the order of 100 MHz) detector, will 
not be discussed here. Solid state detectors considered here are the extrinsic kind, in 
which the photon energy is close to the band gap of the semiconductor material used 
for the detector. Intrinsic photodetectors are used for detection of light with energies 
below the band gap, and depend on deep level traps, or with different energy levels 
in a quantum well, but these are not discussed here. In this chapter, the detectors con- 
sidered are photoconductors, and the junction devices which include the p—i—n diode, 
the metal—semiconductor—metal (MSM) photodetector and the avalanche photodetec- 
tor (APD). The photoconductor is a slow device and also noisy, but its simplicity is an 
attractive feature. 


Preliminaries 


Consider a semiconductor photodetector which absorbs photons with energies at or 
above Eg, the bandgap energy of the semiconductor. Suppose the incident optical power 
is given by Pin, and it is assumed that all the incident photons enter the semiconductor. 
Suppose the photocurrent generated as a result of this incident optical power is given by 
Tp, then the relationship between Pi, and /, is 


Ip = RPin, (4.168) 


where R is the responsivity of the photodetector in units of AW!. 
The quantum efficiency of the detector 7 may be defined as ratio of the number of 
hole—electron pairs generated to the number of incident photons, and is given by 


Ip 
qd hv 

n= = —R. (4.169) 
Pn q 
hv 

Thus, R may be written as 
Xr 

Rate: (4.170) 

hv he 


Suppose the thickness of the semiconductor is w, and the absorption coefficient is a 
Np m_!, then the transmitted optical power escaping from the semiconductor is given by 


Py = Pine. (4.171) 
Thus, the absorbed power is given by 
Paps = Pin — Py = Pin (1 — e*™). (4.172) 


Since every absorbed photon creates one hole—electron pair, the quantum efficiency is 
given by 
Pabs —aw 
= =(l-e™"), (4.173) 
Pj 


which assumes that all the incident photons enter the semiconductor with no reflection. 
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Photoconductor detectors 


The photoconductor detector depends on the increase in conductivity of a semiconduc- 
tor when illuminated with photons of energy above the band gap. The absorbed light 
creates hole—electron pairs, which increases the conductivity, and with an applied bias, 
the excess carriers drift to the appropriate electrodes, and constitute an increase in cur- 
rent. Holes and electrons created by the light may be swept out before they recombine 
by the applied bias field. Alternatively, they recombine as they drift towards the appro- 
priate electrode. The electrons are swept out faster than the holes, and to maintain charge 
neutrality, more electrons are injected, and this constitutes gain. 

A typical photoconductor detector takes the form of a slab of material of thickness a, 
width b and length L, with ohmic contacts at the sides as shown in Figure 4.26. 

The dark current flowing in the slab is given by 


V 
T= qab(nun + Pip)» (4.174) 


where n and p are the free electron and hole number densities, 4) and jp are the mobil- 
ities of the electrons and holes respectively, and V is the applied voltage across the slab. 
When illuminated, the conductivity increases due to the electron-hole pairs created by 
the photons. Thus, the current when the slab is illuminated is given by 


4 
(+ Al) = qba[(n + An)py + (p + Ap)upl-» 
where I is the dark current, assumed to be small. The increase in current due to the 
illumination is 


(4.175) 


V 
AI = qab(Anpn + Apltp) = (4.176) 


Suppose the minority carriers are electrons, then the rate equation for the electrons takes 
the form: 
dAn nP An 
dt  hvabL 1’ 


(4.177) 


where 77 is the quantum efficiency, which determines the number of hole—electrons gen- 
erated per photon, usually taken to be unity, and P is the optical power absorbed. Note 
that an equal number of holes as electrons are generated, but remain as the majority 
carrier. Under steady state, the time variation is set to zero, and then 


Photons 


Schematic diagram of a photoconductor slab: length between contacts L, width b and 
thickness a. 
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ee (4.178) 
hvabL 
This assumes that the photon-generated hole-electron pairs are in the volume of the slab, 
and the volume is abL. The electron current due to these excess electrons is given by 
qnPG 
hv 

where G is the gain of the device. Substituting for An from Equation (4.178), G is 
defined as 


V 
Aly = qAniinba = (4.179) 


TV T 
= ae (4.180) 
iE tirn 
where the electron transit time fr.) is given by 
L L ee 
ten = — (4.181) 


Vy ME fin 
In addition to the electrons, adding the motion of the holes results in this equation 
becoming 
_ (Hn + Mp)tV — ent Mp T 
~ Li? 7 Ln tern , 
Note that since 4p < /Lp, the expression for gain is that given in Equation (4.180). 
Making L as small as possible increases the gain and also reduces the response time. 
If the carriers are swept out before they recombine, then the electrons reach the ohmic 
contact before the holes. To maintain charge neutrality, extra electrons are injected into 
the photoconductor. If sweep out of the carriers occurs, then the lifetime is the transit 
time of the carriers and is given by 


G 


(4.182) 


| es 
En,p (¥) En, pV 
Since the mobility of holes is lower than that of electrons, the transit time of electrons 


is smaller. If up <K fn, then the gain term is dominated by the electron mobility term 
and in turn the transit time 


(4.183) 


tirnp = 


Ga (4.184) 
ten 
Thus, the photon-induced current from Equation (4.179) becomes 


t qnP 
f= A = —— = 4.185 
ph n feta hv ( ) 


In Equation (4.177), the time variation of An may be as e/“', and in the small-signal case 
the response time varies inversely as (1 + j@t), which defines the bandwidth. Thus, if 
the optical power is given by 


P(@) = Pop + Pie, (4.186) 


then the ac current term is 
qnP, T it 


Se (4.187) 


hv tenl+ jor 
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The rms current magnitude is given by 

qnP, Tt 1 

VJ 2hv ten (1 + @0?)1/2° 
Noise in these detectors is from several sources and these are thermal or Johnson noise, 


generation-recombination noise, and at low frequencies, the 1/f flicker noise. If the 
resistance of the device is Reona, then the thermal noise current is given by 


(i nl?) _ 4kpT Af 
t = > 
Reond 


A Tac rms — 


(4.188) 


(4.189) 


where Af is the bandwidth. The generation-recombination noise term is given by 


4qGIhpAf 

2 te) 

= ——. 4.190 
(InoRl?) = FOS (4.190) 
where Jo = g7PopG/at. Neglecting the contribution of 1/f noise at low frequencies, 
the signal-to-noise ratio is given by 


S Al... 
= = ; (4.191) 
N (ttn?) + (ltr?) 
p2 kpT i 
eee | E B (ter )2 "| (4.192) 
8hv Pon Af Gq Racal 


The noise equivalent power (NEP) is defined as the incident rms optical power required 
to produce a signal-to-noise ratio of unity in a bandwidth of 1 Hz, [3], and the detectivity 
of a detector is the inverse of NEP or D=(NEP)~!. The normalised detectivity D* is 
defined as 

—_ AlACAsy? 
NEP 


where A is the area of the photoconductor on which light is incident. The bandwidth 
is usually set to 1 Hz, and the reference area is set to 1cm*. Note that D* is usually 
expressed as D*(A, f, 1), where A is the wavelength of light, f is its frequency of 
modulation, and 1| is the bandwidth in Hz and may be obtained from Equation (4.192). 

The structure of these devices may take various forms, including the interdigitated 
surface contact structure shown in Figure 4.27. 


(cm.Hz'/? W-'), (4.193) 


P-I-N diodes 


Detection of photons with energies at or above the band gap of a semiconductor requires 
that they are absorbed and create hole—electron pairs, and a current be induced due to 
this absorption. The depletion layer of a reverse-biased p—n junction of the semicon- 
ductor causes the holes and electrons to separate and be collected by the appropriate 
contact/collection region. The photons entering this device, schematically shown in 
Figure 4.28, create hole—electron pairs as the light is absorbed, from the top contact 
region, through the depletion layer on both sides of the junction and possibly beyond, to 
the lower contact region. The fields in the depletion layer are high enough to separate the 
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Schematic diagram of a photoconductor detector with interdigitated fingers. 


Photons 


Depletion layer 


Schematic diagram of a p—n junction photodiode. 


holes and electrons, and send them to the respective majority carrier region, holes to the 
p region and electrons to the n region, because of the reverse bias. Holes and electrons 
generated in the p and n contact regions need to be considered differently. The minority 
carriers, electrons in the p region and holes in the n region, about one diffusion length 
from the depletion layer, diffuse towards the depletion layer and are accelerated to the 
appropriate majority carrier region. This diffusion is a slow process, which degrades the 
response of the diode detector. 

The alternative to the simple p—n junction detector is to use a p—i-n structure, the ‘1’ 
region is either an ‘i’ layerorap™ orn layer. In this device, the depletion layer extends 
through the whole of the ‘i’ region with no bias or with a negative bias. The usual top 
p~ layer is made thin to ensure that absorption in the top contact layer is negligibly 
small. Most of the absorption is in the ‘i’ or n~ layer, with some in the lower n™ layer, 
as shown in Figures 4.29 and 4.30. 

Suppose the photon flux per unit area is given by ¢o, and the absorption coefficient is 
given by a, then the hole—electron generation rate is given by 


Gen(z) = goae °7, (4.194) 
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Schematic diagram of a p—n junction photodiode in mesa form, with a ring top contact. 
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Schematic diagram of a p-i—n junction photodiode. 


and the photon flux density ¢o is given by 


_ Pine _ R) 4.195 


where the incident optical power is given by Pinc, R is the reflectivity of the surface 
and A is the device area. Note that the extra a introduced in the generation term is to 
normalise the current density. The drift current density is given by 


w 
Jarift = -a f Gen(z)dz = qdo1 —e %%). (4.196) 
0 


The tail end of the optical power also enters the lower n* region, and generates hole— 
electron pairs. In this region, the hole, minority carrier, motion is determined by the 
diffusion equation 


d7Pn Pn — PaO 
az2 if 


Gen(z) = 0, (4.197) 


where Dp is the diffusion coefficient for holes, tp is the lifetime for holes beyond 
Puno, the equilibrium hole density for the n+ doped layer. The boundary conditions are 
Pn=Pno for z = oo and py = O for z = uw; the former has some validity since the 
structure is generally grown on an n* substrate. The latter boundary condition has the 
excess hole density to be zero since those generated at the boundary are acted upon by 
the depletion layer and accelerated away. The solution is given by 
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Pn = Pno — (Pao + Ce ev 4. Ce, (4.198) 


where Ly = ,/ DpTp and 


2 
[laces (4.199) 
Dp (1-223) 
The diffusion current density is 
0 
Jaitt = —qDp (2) (4.200) 
aLp —aw Dp 
= —, 4.201 
a (ary i a ( ) 


and the total current density is the sum of the drift current density and the diffusion 
current density, and is given by 


J fe ag (4.202) 

tot = 90 (ae, qPr0 Le 

The value of po is small in the nt region, and therefore under illumination, this last 

term is small and is usually omitted, and the current is proportional to the photon flux. 
The quantum efficiency may be obtained from these expressions: 


Ji A —aw 
ja =k (1 7 ) (4.203) 
Popt/ hv 1+aLp 


where Pop: is the optical power absorbed and R; is the reflectivity. To make the quantum 
efficiency high, the diode reflectivity needs to be made small using an anti-reflection 
coating, so that R; © 0. Also ww > 1 makes next closer to unity but at the expense of 
transit time delay, and hence the frequency response becomes small. 

The frequency response of the photodiode is governed by several factors, and these 
are discussed below. The diffusion of the minority carriers generated outside the deple- 
tion region and their transit to the appropriate electrode region are major limitations 
to the response time of the diode. However, careful design may minimise this effect, 
including the use of heterostructures so that the p* and the n* regions are in higher 
bandgap material, which prevents the generation of hole—electron pairs in these contact 
regions. The RC time constant of the diode, where C is its capacitance and R is shunt 
resistance added to extract the signal, determines the circuit response, and is minimised 
by suitable choice of R, and the area of the diode. The carrier transit time across the 
depletion region is also a major factor, and this is analysed below. 

Suppose the incident optical flux is modulated to have a time varying component of 
the form ¢,e/°"’. At any point z in the depletion layer, the carriers generated have to 
drift to the appropriate electrode region at the saturation velocity vs. Thus, the conduc- 
tion current density due to the carriers generated at z has a phase delay of e~J®s2/"s and 
is given by 


Jeona(z) = qgieme ea", (4.204) 
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Plot of the transit time factor in Equation (4.206) against the product of the transit time and the 
modulation frequency (A. Yariv, Optical Electronics in Modern Communications, Fifth Edition, 
Oxford University Press 1997. Figure 4.1 1—4.17. By permission of Oxford University 

Press, Inc.). 


Considering the carriers generated over the entire depletion layer, neglecting the 
absorption coefficient, the total conduction current density is given by 


1” 
Jeond = ~ | qgielom™e JOZ\sdz (4.205) 
w Jo 
l= —jomtir P 
= 991 (=) ae (4.206) 
tr 


where f,; is the transit time equal to w/vs. In addition to the conduction current density, 
the displacement current density is given by 
JOmeV 


Jaisp = jOmeE = , (4.207) 
w 


The total current density is the sum of these two terms. The conduction current density 
is reduced by the transit time factor as shown in Equation (4.206). Figure 4.31 shows a 
plot of this factor (1 — e J@mtr) /(@mttr) against @mftr. Note that when the denominator 
becomes 277, this factor goes to zero. 

The expression for the transit time effect in Equation (4.206) becomes 1/./2 when 
the value of @mftr is 2.4, and also has a phase shift of 27/5. Thus, the 3 dB response is 
given by 

2.4 0.38205 
ty w 


Lae = 0.3820, foraw=1. (4.208) 


Note that including the effect of the absorption coefficient in the above derivations, 
Equations (4.204)-(4.208) in fact reduces the transit time factor further, and therefore 
these results are optimistic. 
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Equivalent circuit of a p—i-n photodiode. 


The equivalent circuit of the p—i—n photodiode is shown in Figure 4.32, and consists 
of a current generator, the diode incremental resistance Rg in parallel, the diode capac- 
itance C also in parallel, the series resistance Rs, which includes the contact resistance 
and the load resistance Ry also in parallel. The inductance of the wire bond or contact 
line in series with Rg has been omitted. Note that Ry is generally small compared to Rg. 

The analysis of this section shows that to make frequency response of the photodi- 
odes high, the capacitance of the diode should be low so that the RLC time constant is 
small and the ‘i’ layer thickness should be between 1/a@ and 2/a, where a is the absorp- 
tion coefficient, and this comes at the expense of responsivity. Using higher bandgap 
materials for the pt and n* regions allows higher fields to be used as the breakdown 
voltage is increased. The usual choice is to make RC time constant equal to the tran- 
sit time across the depletion region. To reduce the capacitance, the area is made small, 
of the order of 501m? or less. High-speed p-i—n diodes have been designed and built 
for many years, and currently diodes with responses in the 50 GHz region are available 
[10]. To reduce the transit time effects, the absorbing layer is made as thin as possible, 
of the order of 0.15 1m, and this is at the expense of responsivity. To overcome the 
reduced absorption in the thin ‘i’ layer, the use of a dielectric mirror above the contact 
layer and an epitaxial mirror below the absorbing layer causes the light to have multiple 
passes, and improves the responsivity [10]. The problem of having this in the n* layer 
requires careful design of the doping of the heterostructure mirror, since current has to 
flow through it. Note that the lower mirror is similar to those used in VCSELS and there 
the mirror series resistance problem has been alleviated considerably. 

Heterostructure diodes in which the p* and n™ regions are of a larger bandgap mate- 
rial, with the absorption ‘i’ or n~ layer, have also been designed. The advantage of this 
structure is that the incoming light is not absorbed in the heavily doped regions, and the 
depletion fields may be higher. However, the heterojunctions may have to be graded to 
prevent carrier trapping. 

The noise output of the p—i—n diode is dominated by the thermal or Johnson noise. 
Then the input signal photocurrent is given by 

Ion = qinPin _ pp. (4.209) 
hv 
where R is the diode responsivity in A W~!. The shot noise is due to the background 
optical power generating a current given by Jp, the dark current Jy due to the reverse- 
biased p-i—n diode and the photocurrent Jpn, and is given by 


(ios?) = 2¢Upn + Ip + lad AS. (4.210) 
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The diode is usually connected to a preamplifier, noise figure F,, input resistance Ra, 
which is in parallel with the diode output resistance Rg and the load resistance across 
the diode Ry, and neglecting the series resistance R, which is small, the equivalent 
resistance given by 


—=(E+E+E)*Z (4.211) 
Reg \Ra Ri Ra Ry’ 
and the corresponding Johnson or thermal noise current squared, including the pream- 


plifier noise figure, is given by 


AkyT Fy 
(Ioi?) = 2s (4.212) 
Req 


Thus, the signal-to-noise ratio is given by 


(atti 
2 ny (4.213) 


AkpT FaAf 
2q Upn + Ip + La) Af + — — 


Zzln 


eq 


Usually, the shot noise current is small, Req is approximately equal to R:, and the 
signal-to-noise ratio becomes 
qNint Pin ‘ 
S hv 


7 ee (4.214) 


Ri 


Thus, to make the signal-to-noise ratio large, RL, needs to be made as large as possible, 
but the problem here is that the RC time constant then becomes large and the response 
time becomes slow. Thus, the values of Ry are of the order of 50-100 Q to ensure that 
the RC time constant approaches the transit time limit. 


Avalanche photodiodes 


The p-i—n diode has no gain, and adding gain enhances its performance. Increasing the 
reverse bias close to or at the breakdown of the diode, the electric field in the ‘i’ or n7 
region becomes high, and results in the carriers being accelerated to a higher velocity 
before a collision with the lattice occurs. If the velocity is high, then this collision may 
be inelastic and causes ionisation, resulting in an additional electron and hole being 
generated. This additional electron and hole together with the original electron are also 
accelerated in turn to have further collisions to create additional hole—electron pairs. 
A schematic diagram of this process is shown in Figures 4.33 and 4.34 in which an 
electron is injected, at the start of the high field region of the depletion layer. This 
figure assumes that the ionisation coefficient for the electrons we is equal to that for 
the holes a. These coefficients are probabilistic, and are the reciprocal of the average 
distance that the carrier travels before an ionising collision occurs. These coefficients 
are a function of the electric field, and vary with the material parameters. In Silicon, 
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Schematic diagram of the avalanche process with injection of an electron into the avalanche 
region, which after collision creates an additional hole—electron pair, which, together with the 
original electron, is also accelerated to have further ionising collisions, resulting in additional 
hole-electron pairs for each carrier. In this diagram, the ionisation coefficients of the electrons 
and holes are assumed to be nearly equal. 
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Schematic diagram of the avalanche diode. 


Qe > Qh; but in II-V material, GaAs and InP and others they are almost equal. The 
collisional increase in carriers results in an increase in the current which implies current 
gain. 

The expression for the ionisation coefficient is given by [3]: 


Ce,h = Ago EXP |- (5) ; (4.215) 


In GaAs, dog is ~1.3 x 10°cm™!, b is 2 x 10° Vcm™! and m is 2. The multiplication 

coefficient or factor M is the ratio of the output current density to the input current 

density, and may be subdivided for the electron component and the hole component. 
Suppose that the total current density in the diode is given by 


Stor = Je(x) + Jn(x), (4.216) 
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where J.(x) is the electron current density, J,(x) is the hole current density and Jiot is 
a constant at any plane of the diode. The multiplication factors are defined as 
Je,out J.(w) 


M. = = (4.217) 
eden da) 


for the electron current density, and 


Shout = Jn (0) 


My => = 
Shin Jn(w) 


(4.218) 


for the hole current density. 

Suppose a p—i—n diode with an ‘i’ region is considered, the field across the ‘i’ region 
is uniform. Also assume that the ionisation coefficients ~_ = ap, and are constant at 
the fixed electric field in the ‘i’ region, and the width of the avalanching region is w. 
The holes and electrons are accelerated in opposite directions, and the collisions result 
in ionisation and multiplication. Assume that at x = 0, there is only electron injection, 
hole injection is zero at x = w, and therefore the avalanching is initiated by electrons. 
In this case, the multiplication factor is given by 


M = 1+ ew + (aew)” + (ew)? + (aew)* +--+ (4.219) 
1 
= —— foraew < l. (4.220) 
1—aew 


This final result implies that the multiplication factor goes to oo when @ew is unity, or 
that each carrier has one ionising collision over the distance w. 

For the case of the p—i—n, with an n~ region instead of the ‘i’ region, the field 
increases linearly from the n* region to the pt region, and the above assumptions 
of constant ionisation coefficients no longer hold. The total current density flow- 
ing in the diode is the sum of the electron current density and the hole current 


density. 
The current densities in the avalanching region satisfy the following equations: 
dJe 
ie AeJe + anJn + GGen(x) (4.221) 
x 
dj 
——S* = ede + andy + gGen(x), (4.222) 
x 
where qGen(x) is the optical generation rate. Subtracting these two equations results in 
d 
—[e(x) + Jn(x)] = 0, (4.223) 
dx 
or 
Je(x) + Jn(x) = constant = Jiot- (4.224) 


This implies that the current density is a constant at any plane in the diode, as claimed 
earlier in Equation (4.216). Substituting for J, from Equation (4.216) by Jtot — Je in 
Equation (4.221) 
dJe 
dx 


— (de — Ah) Je = AnJtot + GGen(x). (4.225) 
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Consider the case when a, = dp, neglecting the Gen term, this becomes 


dJ, 
eh Qe Jtots (4.226) 
dx 
and the solution is 
WwW 
Je(w) = Jus | aedx + J.(0). (4.227) 
0 


Assuming that there is no hole injection at x = w, then Je(w) = Jtot, and this equation 
becomes 


WwW 
Je(w) = Jew) | aedx + J.(0). (4.228) 
0 
Dividing by J.(0), this equation becomes 
Je(w) / * 
=M.=M., d 1, 4.229 
J.(0) e e ‘ edX + ( ) 
or 
M. = : (4.230) 
fe 6 wede , 
For avalanching, 
Ww 
/ dedx = 1. (4.231) 
0 


When a, 4 dp, then Equation (4.225) needs to be solved. This is first-order differential 
equation of the form: 


dy 
—+ P(x)y = O(n), (4.232) 
dx 


and the solution is given by 


y= LOE el PAVE" + yO) 


Eee (4.233) 


Consider the case of electron injection at x = 0, with Gen(x) = 0 and no hole injection 
at x = w, which implies that J;(w) = 0, and Jinn = Je(w). Following the above 
procedure, with these boundary conditions, the solution of Equation (4.225) is obtained, 
and the electron current multiplication factor becomes [8]: 


Je(w) el (te—an)dx’ 
e= = 7 me (4.234) 
J.(0) 1 ie dx! ay (x!el’ (Qe—Opy dx 
When a, = dh, then this equation becomes, as shown in Equation (4.230) 
1 
Me (4.235) 


1- . aed x 


When the denominator of these multiplication factors is zero, the diode avalanches. A 
similar expression to that in Equation (4.234) may be derived for Mp, if the avalanche 
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is initiated by holes. For this case, it is assumed that at x = w, J.(w) = 0, Gen(x) = 0 
and J;(0) = Jtot, and the variable is J, (x) to give 


Jn(w) en Ip @e—an)dx' 


= = 7 ; (4.236) 
J.(0) i= ic dx!e(x!e~ So (ae—aty dx” 


and again the avalanching occurs when the denominator goes to zero. The avalanching 
is dependent on the field in the depletion layer, carrier population and the collision 
frequency, and not dependent on the carrier initiating the avalanche process. By careful 
control of the bias, the avalanche gain may be controlled to values as desired. 

Notice that the generation term has been omitted in these expressions, but may be 
included as necessary. Silicon avalanche photodiodes (APDs) may use the absorption 
region as the avalanche region. In this case, the derivations above with the genera- 
tion term would be the appropriate equations. Near infrared wavelengths have used 
GaAs/AlGaAs APDs, and for wavelengths in the telecommunications band, 1300 nm 
and 1550 nm bands, the absorption layer is generally InGaAs which is lattice-matched 
to InP. The avalanche region for these diodes is separate and usually InP, since large 
leakage currents occur due to tunneling with high reverse bias in InGaAs. This type of 
diode is termed the separate absorption and multiplication (SAM) APDs [26], and is 
currently the usual APDs at these wavelengths, as shown schematically in Figure 4.35. 
Accumulation of holes occurs in the InP—InGaAs valence band junction region, and a 
graded junction alleviates this problem [22]. 

The multiplication factor is a random variable and therefore the excess noise due to 
this avalanche process may be estimated by a noise factor Fy. This factor is a function 
of the multiplication factor M and ka, where Ka is the ratio of the ionisation coefficients 
and ka lies in the range 0 < ka < 1. Thus, ka = ah/ae, provided a, < ae, or 
alternatively ka = Qe/ay, provided a. < ap. The expression for the noise factor is 
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Schematic diagram of the separate absorption and multiplication avalanche photodiode, also 
showing the field in the different regions. 
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Excess noise factor Fa as a function of the multiplication factor M and the ratio of the ionisation 
coefficients ka (G. P. Agrawal, Fiber-Optic Communication Systems, 3rd edition, John Wiley & 
Sons, 2002. ©2002 John Wiley & Sons). Reprinted with permission of John Wiley & Sons, Inc. 


given by [23]: 


1 
Fa =kaM + (1 — ka) (2 = i) : (4.237) 
Figure 4.36 shows the excess noise factor Fa plotted against the multiplication factor 
M, as a function of ka, from the Equation (4.237) [1]. 
The responsivity of the avalanche photodiode includes the multiplication factor M 
and is given by 


Rapp = ue. (4.238) 
hv 


and therefore the input signal photocurrent is given by 


UT) 
Toh, APD = Rapp Popt = M re Pope (4.239) 


The shot noise current squared of the avalanche diode is also enhanced by the 
multiplication factor and the noise factor and is given by 


(Io.17) = M? Fa(RappPin + Ip + Ip) AS, (4.240) 


where [p is the dark current and Jp the background current. The thermal noise current 
squared is given by, as in Equation (4.212), including the amplifier noise figure Fy, 


AkyT Fy 
(icit?) Se (4.241) 


Req 
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Hence the signal-to-noise ratio is given by 


Manint Pin js 
ae hv 


2qM? Fa(Iph +ip+ 1)Af + 


tT AF (4.242) 


Req 


In this case, the shot noise term dominates and therefore the signal-to-noise ratio 


becomes 
Mq Nint Pin 2 
S hv 


N 2qM?Fa(Iph + Ip + La AS’ 


(4.243) 


The APD is the preferred photodiode, but the introduction of the gain stage reduces 
the frequency response. However, recent results have shown that these devices have 
gain bandwidth products of over 300 GHz, with a response of 28 GHz [18]. Lower gain 
results show a higher response of over 30 GHz [29]. These devices are much more 
expensive than p—i—n diodes, although the performance is better because of the gain 
that arises due to avalanching. 


Metal—semiconductor—metal detectors 


A planar version of the p-i—n diode is the metal—-semiconductor—metal (MSM) detector, 
in which the contacts are metal Schottky barrier diodes to a thin undoped semiconductor 
layer, and the region between the metal contacts, usually in the form of an interdigitated 
structure, shown in Figure 4.37, is completely depleted. The gap between the fingers is 
made small, and the transit time limitation is small. The capacitance has two compo- 
nents: one in parallel with the gap across the fingers, and the second is the capacitance 
to the ground of both electrodes. However, this latter may be made small so that the RC 
time constant of the diode is small. Consequently, these detectors are extremely fast, 


Interdigitated form of the MSM detector, in which the contacts are Schottky diodes to the thin 
epitaxial absorption layer. 
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Capacitance against the edge length of the fingers of the interdigitated MSM device for gaps of 
1, 2 and 3 .m spacing, with finger widths of 0.5 and 1.0 pm. The parallel plate capacitance of 
the p—i—n diode is plotted for comparison, with the ‘1’ or n~ layer thickness of 0.5 and | wm. 
The electrode width for the p-i—n diode has not been given in the paper (J. B. D. Soole and 

H. Schumacher, /EEE Journal of Quantum Electronics, Vol. 27, No. 3, March 1991. ©1991 
IEEE). 
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Response of the MSM detector showing the detected photocurrent against the bias voltage for 
different values of light intensity for the 1 jm electrode width and a 2 1m interelectrode gap 
device (J. B. D. Soole and H. Schumacher, JEEE Journal of Quantum Electronics, Vol. 27, 
No. 3, March 1991. ©1991 IEEE). 


but their responsivity is affected by the shadowing effect of the fingers. However, it was 
found that the quantum efficiency increases with increasing bias, which suggests that 
there is some gain that arises due to trap densities at the interfaces between layers, at 
electrode interfaces. However, [30] suggests that with careful growth this gain may be 
much reduced as seen in Figure 4.39. Note that one of the diodes is forward-biased and 
the other is reverse-biased, and again careful processing results in symmetric response 
and soft breakdown. 

MSM photodetectors of InGaAs on an InP substrate [30] for operation at 1.3 jm 
and 1.5 1m wavelengths are discussed here. The Schottky barrier on InGaAs is of the 
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Quantum efficiency of the InGaAs MSM device the electrode widths of 1 jt and different 
spacings from 1, 2, 3 jm inter-electrode spacing. The zero width electrode result is also plotted 
here (J. B. D. Soole and H. Schumacher, /EEE Journal of Quantum Electronics, Vol. 27, No. 3, 
March 1991. ©1991 IEEE). 


order of 0.2 V, and is leaky in reverse bias, and therefore is enhanced by a layer of 
lattice-matched InAlAs under the electrodes. The thickness of the InGaAs layer was 
1.3 jm, and the InAlAs layer was 80 nm. The capacitance between the fingers has been 
calculated by means of the usual Schwarz—Christofel transformation in this paper, and 
it is shown that the edge capacitance of the fingers is much less than the capacitance of 
comparable area mesa-type p—i—n diodes, as shown in Figure 4.38. Provided the series 
resistance is comparable, the RC time constant is much less for the MSM detector. 

The device with a | jum wide electrode with the interelectrode gap of 2 ~m between 
them has a response shown in Figure 4.39 of the detected photocurrent against the bias 
voltage across the electrodes for different values of light intensity on the device. Note 
that the response is not flat but rises, indicating that there is gain in the device; this 
may be in part due to the photoconductor effect and also may be at the higher bias due 
to avalanching near the electrodes. The quantum efficiency of the MSM detectors is 
hampered by the shadowing effect of the electrodes and this is plotted for this InGaAs 
device in Figure 4.40, which shows that this may rise to as high as about 70% for the 
1 jm electrode width and a 2 \.m interelectrode gap device. The bandwidth of the device 
is in the 20 GHz region, and other devices on GaAs show similar or better frequency 
response. 


Travelling wave p-i—-n photodiodes 


As discussed above, the frequency response limitation of p—i—n diodes arises from 
the transit time of photogenerated carriers reaching their respective electrode contact 
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regions, and also from the RC time constant of the device. Making the absorption 
layer small, 0.3 jum, results in poor responsivity; placing mirrors underneath to obtain 
a double pass improves the response, but the RC time constant rises. The MSM device 
reduces the RC time constant but the transit time limitation remains. The second lim- 
itation is the device saturation that occurs when the light intensity becomes high. The 
alternative is the waveguide photodetector (WPD), which is an edge-fed p—i—n diode 
that is made very narrow and long, and the absorption layer may be thin to overcome the 
transit time limitation, making it highly efficient. However, the RC time constant limits 
the device response to about 55 GHz bandwidth-efficiency value (product of bandwidth 
and quantum efficiency) [16]. 

To overcome the problem of the RC limitation of the waveguide detector, and also 
be able to handle high power detection, the travelling wave detector was proposed by 
[8, 32]. An optical waveguide with a thin absorption layer is the basis for this device. 
The absorption layer is made thin enough to only absorb a small fraction of the light 
per unit length so as not to saturate the device, thus high power signals may be detected 
without saturation. However, the limitation of the travelling wave detector arises from 
the velocity mismatch between the electrical wave on the electrode structure and the 
optical waveguide. The electrode on top of the guide with the accompanying ground 
electrodes on the sides form a coplanar waveguide, in which the detected signal trav- 
els in the form of a voltage/current wave, see Figure 4.41. The optical signal travels at 
the guide layer group velocity, usually at c/ng, and ng is the group index of the guide 
layer. The detected signal forms electrical forward and backward waves on the elec- 
trode structure. The backward wave reflects at the input of the electrode structure, if it 
is an open circuit at the start of the detector, or alternatively, the wave is absorbed in 
the load if matched. Thus, the electrical wave travelling on the electrodes is a combina- 
tion of the forward wave and the reflected component of the backward wave travelling 
in the forward direction, and also the backward wave traveling in the reverse direc- 
tion. The lack of velocity match between these two optical and electrical waves leads 
to walk off, and the frequency response drops [15]. However, bandwidth of 172 GHz 
and bandwidth-efficiency product of 76 GHz [12] and pulse response with transform 
bandwidth as high as 560 GHz [28] have been reported. Careful choice of the absorb- 
ing layer, absorbing coefficient of ao, and its thickness and location in the waveguide 
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Schematic diagram of a travelling wave photodiode, with coplanar electrodes (G. Rengel-Sharp, 
R. E. Miles, S. Iezekiel, The Radio Science Bulletin, Vol. 311, No. 12, pp. 55-64, December 
2004. ©2004 URSI). 
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Schematic diagram of a travelling wave photodiode, showing the p-i-n structure with the 
absorbing layer, and the electrodes. 


allow the confinement factor I to be determined, and thus the decay factor becomes 
e 0! for the optical wave intensity. These types of travelling wave devices are the fully 
distributed detectors. A second design has used a passive guide between MSM detec- 
tors [20], in which each detector only detects a small fraction of the power carried by the 
optical wave. Velocity matching allows the 3dB frequency response to rise to 49 GHz. 
Both these types of travelling wave devices are the fastest photodetectors built at the 
present time. The usual optical waveguides are of the ridge type, with the top electrode 
and adjacent ground electrodes of the coplanar waveguide (CPW) type, as shown in 
Figure 4.41 [27]. 

The theory of the distributed detector has been developed by [12] and [15]. Essen- 
tially, the p—i-n diode has to support the optical mode, and the electrode structure 
supports the electrical mode. Figure 4.42 shows a typical structure of this travelling 
wave photodetector. 

The optical mode travels in the central p-i—n region, and the electrical mode is sup- 
ported by the electrode structure transmission line shown in this figure. The equivalent 
circuit of the elemental section of the line is now the usual R—L in series and C-G in 
parallel, but now with the elemental p—i—n structure in parallel; the modified circuit is 
shown in Figure 4.43. Note that the usual capacitance is that of the depletion layer, and 
therefore is across the current source. 

In this circuit, Repw is the series resistance of the central electrode and varies due 
to the skin effect, increasing as ./f. The semiconductor series resistance Rsemj is in 
series with the depletion layer, and is determined by the doping levels of the pt and 
n* layers, shown as p and n in Figure 4.42. The capacitance of the central electrode is 
dominated by the depletion layer capacitance of the p—i—n diode, which is much larger 
than the edge capacitance. The depletion layer capacitance gives rise to a slow wave 
effect, similar to that suggested by Hasegawa [14]. The inductance is the usual CPW 
value, and is unaffected by the presence of the p—i—n structure. The series resistance Gg 
is the top contact layer in parallel with the electrodes. The distributed current source is 
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Schematic diagram of the equivalent circuit of a travelling wave photodiode (G. Rengel-Sharp, 
R. E. Miles and S. Iezekiel, The Radio Science Bulletin, Vol. 311, No. 12, pp. 55-64, December 
2004. ©2004 URSI). 


related to the input optical power P,, and is given by 


Tape) % e~ JB oz, 


ie 
Inte) = Po (4.244) 
he 


where A is the wavelength of the optical power, a is the absorption layer coefficient, T° 
is the confinement factor in this absorption layer, 6, is the optical propagation constant 
and 7 is the quantum efficiency of the distributed p—i—n diode. The terminations at the 
input may be an open circuit or a matched load, and at the output is matched to extract 
the signals on the electrical wave. This equivalent circuit may be used to analyse the 
performance of the travelling wave detector. 

The major concern in the treatment of the travelling wave detector is the velocity 
mismatch between the optical wave and the electrical wave on the electrode structure. 
Giboney ef al. [12] have analysed the transmission line using an impulse excitation, 
to account for the velocity mismatch. Performing the Fourier transform, the following 
expression for the normalised photocurrent is obtained for the case when Ta ,f « | for 
the frequency w: 


1 
1)(@) = of + yo) eiotive), (4.245) 
2 | wp — jo QO, + j@ 
where 
Taove 
of = ——— 
1—ve 
( Vo ) 
and 
T'dove 
oO, = ———~. 
* 1+ v¢ 
Vo 
y is the electrical reflection coefficient at the device input, y = 1 for an open circuit 


and y = 0 for matched load. The term ve is the electrical wave velocity and vo is the 
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Normalised current response for both the velocity-matched and -mismatched cases for both the 
open circuit and the matched terminations at the input (G. Rengel-Sharp, R. E. Miles and 

S. Iezekiel, The Radio Science Bulletin, Vol. 311, No. 12, pp. 55-64, December 2004. ©2004 
URSD. 


optical wave velocity. The results of this calculation are shown in Figure 4.44. Notice 
that the normalised current for the matched case has no frequency variation, but since 
half of it goes to the input-matched load, the output current is reduced by half. For 
the velocity-mismatched case, for y = 0, again the frequency variation is lower than 
for the case when the input has an open circuit, when y = 1. The open circuit case 
has considerable frequency variation, but the current is higher at all frequencies for the 
velocity-matched case, and becomes asymptotic to the y = 0 case. For the velocity- 
mismatched case, the current falls off faster at the higher frequencies, and becomes 
asymptotic to the velocity-mismatched y =0 case. 

For the velocity mismatched case with matched input termination, y = 0, when 
LT aq > 0, the 3 dB bandwidth is given by [27] 


Ie 
fas = 2 — Oe, (4.246) 
27 (Uo — Ve) 
and for the case when open circuit input termination, y = 1, the bandwidth is 
rr 
Age (4.247) 
3 


This last result holds for velocity mismatch in the range 0 < ve/Uo < 1.47 [13]. Addi- 
tional losses occur due to the optical guide scattering loss and microwave transmission 
line loss, and carrier transit time effects, and these will change the results and hence 
the performance of the photodetector. Since the optical waveguide is in the heterostruc- 
ture form, carrier trapping may be a problem, but graded interface junctions would 
alleviate this. 
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Schematic diagram of a photo-HBT showing the depletion layers. 


Heterostructure bipolar transistor photodetector 


The bipolar transistor acts as excellent photodetector with gain. In the HBT, Figure 4.45, 
the absorption of light occurs solely in the base, and possibly in the subcollector region. 
In a double HBT in which the emitter and collector are of larger bandgap material, the 
absorption is confined to the base region, even though the base is not fully depleted. The 
usual modus operandi is to leave the base open circuit, as the light-generated carriers 
constitute the base current, though in some case to avoid carrier trapping contact is 
made to the base. The transit time of the electrons in the base of an n—p—n transistor 
determines the frequency limit of operation of the HBT, in addition to the base resistance 
effects. For a reasonable fraction of the incident to be absorbed, the base and the base— 
subcollector regions should be of the order of | tm thick, which makes these devices 
extremely slow, typically about 1 GHz bandwidth. 

According to [3], the optical gain G of the single heterojunction phototransistor is 
the ratio of the number of carriers in the collector current due to the photogenerated— 
base and subcollector carriers to the number of photons incident on the base- 
subcollector: 


hvIph— 
Colles (4.248) 
q Pine 
For a thick subcollector depletion width, which is greater than | /a, then 
G = npr, (4.249) 


where 7 is the quantum efficiency and r is optical current gain of the photo-transistor. 


Problems 


(1) A GaAs LED has a structure shown in Figure 4.1, and injection takes place from 
the n* layer into the p layer where the recombination takes place. Suppose the 
lifetime of the electrons in this layer is 10 ns, the electron mobility is 3000 cm? 
(V.s)~! and hole mobility is 300 cm? (V.s)~!, determine the thickness of the p 
layer if it is to be one diffusion length. 
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(2) 


(3) 


(4) 


(5) 


Calculate, numerically, the correct value of the external quantum efficiency of a 
LED radiating into air. Assume that the semiconductor index is 3.5, the polari- 
sation random which implies that half the radiation is perpendicular polarisation 
and the other half is parallel polarisation. Assume that the transmission coeffi- 
cient for the perpendicular polarisation from medium 1, index nj, into medium 
2, index 72, is given by 


2n 1 cos 04 


T= , 
(n1 cos 0; + n2 cos 62) 
and the transmission coefficient for parallel polarisation is given by 


2n 1 COs 02 


at (n2 cos 6, +n, cos 62)" 

A ridge laser is to operate at 800 nm, with a guide index of 3.41 and cladding 
index of 3.43. Determine the maximum thickness and width of the laser wave- 
guide. 


An InGaAsP laser operating at 1300 nm has an effective guide index of 3.4. What 
is the reflectivity of its cleaved facets? Suppose the internal loss is 30cm7!, 
what is the photon lifetime, assuming the group index is 3.5? Suppose the laser 
length is 300 \1m, width 3 j1m and thickness of 0.2 1m, the gain coefficient a 
is 2 x 107! cm*, the value of the transparency number density is 10!8 cm~? 
and carrier lifetime is | ns. What is the threshold carrier density and the thresh- 
old current? What is the relaxation oscillation frequency at twice the threshold 
current? 


A p-i-n diode receiver at 1300 nm has a preamplifier noise figure of 2 dB. The 
signalling rate is 1 GHz, bandwidth of 100 MHz, the load resistance is 100 Q, 
dark current is 1 nA, the diode quantum efficiency is 0.95 and a signal of 5 pW 
illuminates this diode. Determine the signal-to-noise ratio at the output. 


(6) Suppose the p—i—n diode is replaced by an avalanche photodetector, multiplica- 
tion factor of 10, noise factor of 3, dark current of 10 nA and background current 
of 1 nA. This APD is illuminated with | jt1W of optical power; determine the 
signal-to-noise ratio. 
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5.1 


5.2 


5.2.1 


Building blocks for high-speed 
analogue circuits 


Executive summary 


This chapter is about the use of electronic devices in elementary circuit blocks found in 
any micro- or millimetre-wave system — or in the analogue portions of fibre-optic com- 
munications systems. An introductory section describes linear two-ports on the basis of 
scattering parameters, discusses different gain definitions and treats important aspects 
of stability as well as noise in two-ports, with special emphasis on noise reduction by 
proper choice of generator impedance. 

Following this, amplifiers, oscillators and mixers are described in sequence. In the 
amplifier section, small-signal parameters are used to derive fundamental properties of 
common topologies, from the simplest, one-transistor circuits to more complex gain 
cells, such as the cascode and differential amplifiers. Tuned amplifiers are covered, 
as well as broadband amplifier techniques, including distributed amplification. Finally, 
low-noise and power amplifiers are being treated, as well as non-linearities in amplifiers. 

The oscillator section discusses how small-signal instability and non-linear gain com- 
pression effects combine to create stable sinusoidal oscillations. Important oscillator 
topologies and noise phenomena affecting the phase stability of oscillators are also 
covered. 

Mixer circuits show how specifically designed non-linear circuits provide frequency- 
translating capabilities. Mixing principles are discussed first, followed by several mixer 
topologies using field effect and bipolar transistors. 


Basic relations for two-port networks 


Scattering parameter theory 


Small signal equivalent circuits for semiconductor devices and circuits are usually repre- 
sented in two-port form as shown in Figure 5.2. At low frequencies, two-port networks 
are represented by an impedance matrix, an admittance matrix, a hybrid matrix or a 
chain matrix. These matrices are described by Z, Y, h or ABCD parameters. Such rep- 
resentations are suitable at low frequencies where the parameters may be measured 
by placing short or open circuits at the input and output terminals of the two-port. At 
high (microwave) frequencies where there are travelling waves, short and open circuits 
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Source and load circuit for the S-parameter discussion. a and b are normalised power waves. 


cannot be precisely placed and the above-mentioned matrix representations cannot be 
accurately determined. The development of the vector network analyser made it pos- 
sible to perform measurements of high frequency travelling wave circuits. Scattering 
parameters were introduced by Kurokawa [24]. He defined a set of normalised power 
waves a and b and introduced a normalising impedance Zo: 


V+Zol 
ee ee 5.1 
. 2/Zo OD 
V— Za 
aie (5.2) 


2./ Zo , 
where V and J are the voltage and the current, respectively, at a load Z. 
These normalised power waves are chosen in this way so that they relate to the power 
delivered to the load. Refer to Figure 5.1 where a source with a real source impedance 
Zo is connected to an arbitrary load Z. The maximum power is delivered to the load if 
Z = Zp = Zo. Then, 
VP Vg 
P. pe — all — Ss. = 2 
Z,max Zo 4 Zs | a | 
|a|* is hence the available power from a generator with source impedance Zp. This also 
tells us that the unit of a (and b) is VW. 
Now consider: 


Jal? — |b)? = AVP. (5.3) 
This is the power delivered to the load for arbitrary Z. We can hence interpret |a|? as 
the power travelling towards the load, and |b|? as the power travelling from the load 


back to the generator, with the difference dissipated in the load. The ratio of b to a is 
the reflection coefficient: 


: (5.4) 


because Z = Vj /J}. 

We now expand the normalised power wave concept to a two-port. The incident 
waves at each port are again designated as a and the reflected waves are designated 
as b, while the subscript denotes the port where the power waves are measured. For the 
two-port, the normalised power waves are a1, b;, a2 and b2, as shown in Figure 5.2. The 


More free ebooks : http://fast-file.blogspot.com 


Fig. 5.2 


Building blocks for high-speed analogue circuits 293 


Port 1 Port 2 
ee ——— 
ray 41 | Two-port network} XH 
SO ee ——- 
b, by 


Two-port network embedded in a transmission line with characteristic impedance Zp, with 
incident waves a1, dz and reflected waves by, bo. 


scattering parameters are the coefficients of linear equations relating the reflected waves 
b to the incident waves a: 


by = Sia, + S}2a2 (5.5) 
by = So\a) + $2242. (5.6) 
The scattering parameters can therefore be expressed as the ratio of two power waves, 


provided that all the ports are terminated in a non-reflective fashion (a = 0 at all other 
ports). For a two-port, 


b 

Siu = |ay=0 (5.7) 
ay 
b 

Si2 = —la=0 (5.8) 
a2 
b 

Sx = — ao—0 (5.9) 
ay 
b 

Sy = —|ay=0- (5.10) 
a2 


These relations can be written in matrix form as follows: 


aces a eae (5.11) 
bz S21 S22 a2 
Let Zo be the characteristic impedance of the transmission lines connected to ports 1 
and 2 of the two-port. If port 2 is terminated by Zo, there is no reflection at the load and 
hence the wave incident at port 2, a2, is zero. Similarly, if port 1 is terminated by Zo 
and the stimulus is fed to port 2, a), is zero. If port | is designated to be the input and 
port 2 the output, then $1; is the input reflection coefficient with Zo the output, S22 
is the output reflection coefficient with the input terminated by Zo, S2; is the forward 
transmission coefficient with Zo as the output load, and 52 is the reverse transmission 
coefficient with Zo at the input. 

In general, the scattering parameters are complex. The polar form of the scattering 
parameter is useful in many applications: 


S=|S|exp!?, (5.12) 


where | S | is the magnitude of S and @ is the phase. 
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Properties of scattering parameters 

The following properties of the scattering parameters are important in two-port network 
applications. Subsequently, it is assumed that the transmission line is lossless with negli- 
gible attenuation, such that the line’s complex propagation constant y = a + jB © jf. 


(i) Reciprocity. Passive networks are reciprocal (unless they contain non-reciprocal 
components like isolators or circulators), and the S-parameters satisfy 


Sik = Skj- (5.13) 
This property can be written in general form as 
[S] = [ST]. (5.14) 


It states that the matrix is equal to its transpose denoted by [S"]. 

(ii) Lossless networks. An important property of lossless networks is that the product 
of the transposed complex conjugate scattering matrix and the scattering matrix is 
equal to the unitary matrix. 


[S]Is"}* = (71, (5.15) 
where a 
[T]= 01 (5.16) 


defines the unitary matrix. 
(iii) Lossy networks. In lossy networks, the network itself dissipates power, hence 


Doha P > DEL be. (5.17) 


The scattering matrix satisfies the property 
[7] —[S]LST]* > 0. (5.18) 


(iv) Reference planes. Measurements can be made at different planes along the trans- 
mission lines connected to the two-port network; this changes the results due to 
the signals’ travelling wave nature. The reference plane is the position where the 
actual measurements are made. If the positions of the ports are shifted by electri- 
cal distances 6 away from the reference planes, the S-parameters in these shifted 
planes can be related to the initial S-parameters in the reference plane. 

If the S-parameters were measured originally at the planes z; = 0 and z2 = 0 
and if the reference planes are now at z} = ¢; and z2 = £2 as in Figure 5.3, the 
resulting S-matrix is given by 


Sip Sin] Sit exp 7% Si ep ais 
85, S5> . S24 exp J@i+ %) Soo exp 262 ’ : 
where 
A = Bey 
02 = Be. 


The expression (5.19) assumes that the transmission lines are lossless. 
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Port 1 Port 2 


Two-port network 


z=t, z=0 Zz 


Change of reference planes. 


The scattering parameters can be converted to current-voltage parameters such as 
impedance ([Z]) parameters as well as admittance ([Y]) parameters. These conversions 
are given by Gonzalez [14]. 


The Smith chart 


The analysis of two-port networks at microwave frequencies was tedious and time- 
consuming before speedy computation methods were available with computer-aided 
design software. A graphical aid to calculate various network properties such as 
impedances was developed by Smith [36, 37] and referred to as the Smith chart. The 
accuracy of results obtained from the Smith chart is quite adequate in most cases. 

When a transmission line is terminated in an arbitrary impedance Z, there are reflec- 
tions along the line, and the reflection is defined as the ratio of the voltage in the wave 
reflected from the terminating load to the voltage in the wave incident on the terminat- 
ing load. In Equation (5.4), we had already defined the reflection coefficient , which 
can be expressed as 


Z—Zo 
r= ‘ 
Z+ Zo 


(5.20) 
In most applications, it is convenient to use the value of the impedance normalised 
to the characteristic impedance or other reference impedance. The normalised value is 
given by 

Foes, (5.21) 


Equation (5.20) can now be written as 


= 
raz 


= : 5.22 
z+1 ( ) 


Both z and I’ are complex quantities and they are written in terms of their real and 
imaginary parts: 


T=u+ju (5.23) 
Z=r+ jx. (5.24) 
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z plane T plane 
Z=r+ jx T=utjv 
v 


> x 
> 


xX=+1 


+f 4 


; > 


xX=-1 


Mapping between the z plane and the I plane. 


It is often useful to express the reflection coefficient in polar coordinates. 
r=|T|el?, (5.25) 


where | I" | is the magnitude, and @ is the phase of the reflection coefficient. 

It follows that z and I are defined in two complex planes. Equation (5.22) gives 
the relationship between points in the two complex planes. The relationship is known 
as mapping. Equation (5.22) is a bilinear transformation where orthogonal lines in the 
z plane map into orthogonally intersecting circles in the I’ plane. Furthermore, it is a 
conformal mapping whereby the angle between the two line segments is maintained in 
mapping between the z and I’ planes. It is to be noted that a straight line is simply a 
degenerate circle. Figure 5.4 shows the correspondence between points in the z and [° 
planes. Expanding Equation (5.22) we have 


. r+jx—-—1 
= 5.26 
ee r+jx+1 ( ) 
Equating real and imaginary parts on both sides, 
2 2 
-—1+. 
p= iii ad (5.27) 
GP lp axe 
2x 

(5.28) 


"GE Det 


By eliminating x from Equations (5.27) and (5.28), we have 


2 2 
(sayeee(y om 
. r+1 7 r+1/ : 


By eliminating r from Equations (5.27) and (5.28), we obtain 
i? f1\* 
Ge=i1 + (v = =) - (<) ; (5.30) 
x x 


More free ebooks : http://fast-file.blogspot.com 


Fig. 5.5 


5.2.3 


Building blocks for high-speed analogue circuits 297 


Constant reactance 


Constant resistance and reactance circles in the T’ plane. 


Equation (5.29) is the equation of a family of circles with their centres at u = r/r + 1, 
v = 0 and radii equal to 1/r + 1, while Equation (5.30) is the equation of a family of 
circles with their centres at vu = 1, v = 1/x and radii equal to 1/x. Equation (5.29) 
represents constant resistance circles; each value of r > | represents a circle. Equation 
(5.30) represents constant reactance circles which are plotted for all values of z when 
Re(z) => 0. Both constant resistance and constant reactance circles are shown in Figure 
5.5. A typical Smith chart representation for practical use is shown in Figure 5.6. 

The Smith chart can also be used as an admittance chart. The constant resistance cir- 
cles become the constant conductance circles and the constant reactance circles become 
the constant susceptance circles. The bilinear transformation in this case is 


I-y 
LC =——_, (5.31) 
l+y 
The impedance and admittance representations of the Smith chart are symmetric with 
respect to the origin of the Smith chart. Because in typical impedance-matching prob- 
lems, both impedance and admittance representations are needed, they are frequently 
plotted in the same diagram (see Figure 5.7). 


Impedance matching 


To maximise gain in an amplifier, its input and output impedances must be chosen to be 
the complex conjugate of the generator and load impedances, respectively. To achieve 
this, impedance-matching networks are connected at the input and the output of the 
amplifier, which convert the true generator and load impedances (frequently 50 2) to 
the necessary values. Figure 5.8 shows the block diagram of the complete network. 
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Aside from the complex conjugate match, also referred to as power match, impedance 
transformation may be necessary to achieve minimum noise figure, or maximum output 


power. Detailed discussions of these techniques (with diverging goals) are given later in 
this chapter. 


5.2.4 Power gains for amplifier design 


The power gain in microwave circuits is expressed in terms of the scattering parameters 
for convenience in performing calculations using network analyser measurements. 


More free ebooks : http://fast-file.blogspot.com 


Fig. 5.7 


Fig. 5.8 


Building blocks for high-speed analogue circuits 299 


COO 

SYS {) 

KROES —j2.0 
an 


1.0 


Matching 
network 


Two- __» | Matching 
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Block diagram of a two-port embedded in impedance matching networks. 


Input and output reflection coefficients for arbitrary terminations 
The two-port network in Figure 5.9 is now assumed to be an amplifier circuit. We will 
first calculate the input and output reflection coefficients T'jn and Mout, respectively, for 
arbitrary source and load reflection coefficients. 

If Zy is the load impedance in a transmission line system of characteristic impedance 
Zo, the reflection coefficients at the source and load (Figure 5.9) are given by 
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Ze S a by 
Amplifier S Z 
“Oe, ‘= 
i a 
T's | | Tin | Tr 
Amplifier representation. 
Zs — Zo 
ls = >—— (5.32) 
°~ Zs + Zo 
ZL — Zo 
ry = =——.. 5.33 
a (5.33) 
Writing Equation (5.11) in expanded form, we have 
by = Sia, + S}2a2 (5.34) 
bz = Soa) + $22a2. (5.35) 


It is evident from Figure 5.9 that reflection coefficients are related to the incident and 
reflected waves by the following equations: 


b 

ee (5.36) 
ay 

az =T bo. (5.37) 


By substitution into Equation (5.11), it can be shown that 


Si2Sai TL 
Tin =S ———_—, 5.38 
ie St SoTL (5.38) 
The output reflection coefficient is defined as 
bo 
Tout = — lvs=0 - (5.39) 
a 
By substitution in Equation (5.11), it can be also shown that 
S12S21T's 
Vout = Soo + ————.. 5.40 
out 22 l= Sts ( ) 


Powers at input and load 
When designing amplifiers, different definitions of power gain are applied depending 
on the application. It is necessary to define the power at the input and the output of the 
amplifier in order to obtain the gain. 

The power delivered to the input port of the amplifier is written in terms of the inci- 
dent and reflected waves at the input. The reflected wave can be expressed in terms of 
the incident wave and the reflection coefficient, see Equation (5.36). 
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Pin = [a1 ? — |b? (5.41) 
=!a1 ? (1-1 Tin P). (5.42) 
Similarly, the power delivered to the load Z, is 
PL =| by |? — | a2 |? (5.43) 
=lb2? (1-1TLP). (5.44) 


The power travelling towards the load is partly due to the wave originated by 
the generator electromotive force bs, and partly by the reflection from the source 
impedance I's: 


a, =bs+T sh (5.45) 
with 
VsJ/Z 
pea" (5.46) 
Zs + Zo 
Zs — Zo 
Ts = ——_.. 5.47 
Zs + Zo oe 


Detailed analyses are given by Gonzalez [14]. 
The power available from the source is labelled Pays and is defined as 
ees ae P (5.48) 
* T= | Te? 
= jg, 2 a Esti? 
" 1=(PsP 
Note that Pays is the input power under conjugate match, i.e. when Tin = r. 


The power available from the network Payn is defined as the power delivered by the 
network when the load is conjugately matched to the output impedance, or 


|S21|7|bs |? 


n= : 
wm [1 = SuPs|2(1 = [Poutl2) 


(5.49) 


Power gain definitions 

Let us now discuss the power gain definitions which are important in amplifier design. 
The operating power gain Gp is the ratio of the power delivered to the load to the power 
delivered to the input of the amplifier: 


PL 
oe s 
_ jbo)? 1-1 PL iP 
~ far] I= | Tin [2 
Since 
by S21 


ay T= TS’ 
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and using Equation (5.38), we obtain 


1- (PL? 
|1 — PL S22|? — [S11 — TLA(S)|?’ 


Gp = |So1|” (5.50) 
where A(S) is the determinant of the scattering matrix. 

The transducer power gain Gy is the ratio of the power delivered to the load to the 
power available from the source: 


PL 
Gr= 
Eo Ps, 
_ |b |? d= IPL?)d = IPsl?) 
ay [1 —TinT's|? 
1—|T|Dd — |Psl? 
~ 5,72 (= (PL) = Psi?) esp 


[1 —TpS22 — Ts (Sty — PLACS)) |?” 


In the absence of deliberate feedback, the reverse transmission Sj2 is frequently very 
small and can be neglected. $;2 = 0 simplifies the denominator in Equation (5.51) and 
we obtain the unilateral transducer power gain 


(1 —|FLI?)d — |Ps|?) 
| — SyTL)C — Sx2Ps)|?° 


Gru = |Sou/* (5.52) 


The available power gain Gq, finally, is the ratio of the available gain from the network 
Payn to the available power from the source Pays: 


Payn 
Ga = — (5.53) 
Pays 
2 1—|Fs\* 
= | S21 | 2 2 
[1 — SypVs|*C = |Poutl*) 
= |Sai? perl 


[1 — SyT's|? — [S22(1 — Sis) + S12S2iTs|?’ 
using Equation (5.40). 


Stability 


Ga still contains I's as a variable. We know already that the maximum power transfer 
from the source to the load occurs for 's = I}, so this appears to be an optimum 
choice. Before we proceed, however, let us again investigate Equation (5.53). If the 
denominator becomes zero, the available gain would grow beyond all bounds. This 
happens if 


[1 — SyiVs| = [S220 — SyiPs) + Sy2S10 sl, 


or, written differently, 


=l= IT outl, 
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using Equation (5.40). Likewise, we can show from the available gain in the reverse 
direction that it would grow beyond all bounds for |T'in| = 1. 

Both these conditions are considered as the instability of the amplifier, a potentially 
dangerous situation which may lead to malfunction or even fatal failure. In most cases 
(the notable exception are oscillators), it needs to be avoided. 


Unconditional stability 
From the above, we can deduce that a two-port will be unconditionally stable if 


S12S21 PL 
S 5.54 
uct 1 Sul (5.54) 
for all |L| < 1, and 
S12S21TG 
S —— 1, 5.55 
| 22 + ia sate < (5.55) 


for all |G| < 1. 


Stability circles 

The following discussion follows Hoffmann [20]. For conditionally stable two-ports — 
where at least one of the conditions in Equations (5.54) and (5.55) is violated — we can 
still find generator and load admittances which allow stable operation. If we plot the 
locus of 

S12S2iTG | _ 1 

1—S$iiVG 

in the complex Ig plane, we obtain a circle with centre vector 


A(S)* S22 — Shi 


S22 + 


lec = —— >: 5.56 
S¢ = TAF SnP 2 
The radius is 
S12 8 
ios | - 21 =: (5.57) 
ACS)? = Sui? | 
Likewise, plotting the locus of 
S r 
Sut 12S210L | _ 1 
1 — SoTL 
in the complex I’, plane, we obtain a circle with centre vector 
A(S)*S11 — S55 
The = woe? (5.58) 
|A(S)|? — |S22I? 
and radius 
S12 S 
|Si2 Soi (5.59) 


"6 * TAG)2 — [Sal] 


Examples for stability circles in the generator and load planes are shown in Figure 5.10. 
The stability circles provide the boundaries between the stable and the unstable regions; 
however, we still need to determine whether the inside or the outside of the circle is 
stable. Let us do this for the Ig plane first. 
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(a) (b) 


Stability circle examples: (a) in the g plane and (b) in the ly, plane. 


Using Equation (5.40), we conclude that ou:G = 0) = S22. Now we locate the 
area which contains Ig = 0 (the centre of the Smith chart). We can say: 


e If |S22| < 1, then the region which contains g = 0 is the stable region. 
e Otherwise, if |S22| > 1, the region containing [g = 0 is the unstable region. 


The same procedure applies for finding the stable region in the I’, plane. 
Using the stability circles, we can look at stability in a different way. A two-port will 
be unconditionally stable, if all of the following conditions are fulfilled: 


@ [Si] <1 

(ii) [S22] < 1 
Gi) |i.) > l+ri 
(iv) [TG.cl > 1+1re. 


In this case, both stability circles are fully outside of the Smith chart unity circles in the 
reflection coefficient plane (which contain all |G], |_| < 1), and the outer regions of 
the circles are the stable ones. 


Rollet’s stability factor 


Conditions (iii) and (iv) above can be combined into the following: 


k>1+max|0 JAS) = 1 (5.60) 
" 2Syol|Sail |’ ; 
where 
1 — |Si1|% — |S22|2 + JAGS)|? 
—_ [Sii|- — |S22|- + |ACS)| (5.61) 


2|S12|| S211 


k is the Rollet factor. k > 1 is often used as a stability criterion; however, note that it is 
a necessary, but not sufficient requirement for unconditional stability. 
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A very common, and potentially fatal, mistake is to assess stability only for the 
intended frequency of operation. It must be investigated over the full frequency range 
where instability may conceivably occur — parasitic oscillations at very low frequencies 
are extremely common, and instabilities may also increase with increasing frequencies 
such as in cascode amplifiers (see p. 333). 


Maximum available gain and maximum stable gain 


Let us now reassess the case of an amplifier with simultaneous complex conjugate 
match at the input and output ports, ie. Ts = I%,%L = WOut- The available gain 
in this case is the maximum available gain and can be written using the Rollet factor 


Equation (5.61) as 
S21 


Si2 


MAG = (k a FS 1) (5.62) 


Obviously, a real solution exists only if k > 1. 

For k < 1, the maximum stable gain is frequently quoted. This is the MAG in the 
limit of k = 1, which can be obtained by the resistive loading of an otherwise not 
conditionally stable amplifier: 


S21 


MSG = 
Si2 


: (5.63) 


Mason’s unilateral gain 


The concept of unilateral gain for a two-port network was first introduced by Mason in 
his paper [28], which is now considered a classic. A comprehensive review of the paper 
and its relevance today is given by Gupta [15]. 

The unilateral gain is defined as the maximum power gain obtained by a two-port 
when it is made unilateral — unilateralised. A two-port network that includes an active 
device is made unilateral by a lossless and reciprocal four-port network connected to 
input and output of the two-port under investigation. This network provides the nec- 
essary feedback to impose the unilateral condition. Mason’s unilateral gain is not to 
be confused with the unilateral transducer power gain Gry, Equation (5.52), which 
had been derived by neglecting the reverse transmission. Here, reverse transmission is 
eliminated by a unilateralising network. 

The unilateral gain is a figure of merit which is intrinsic to the device and hence 
independent of the circuit in which the device is placed. The unilateral gain is, therefore, 
an invariant property of the device. Hence, it can be expressed in terms of the device’s 
small-signal parameters. The scattering parameter representation of U is the most useful 
in microwave applications. Ku [23] has given an expression in terms of the S-matrix: 


| Siz — Sai |? 


v= ; 5.64 
A((1] — [S*]™[S]) oe 
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where [J] is the identity matrix. U can also be expressed with the help of the stability 
factor k [15]: 


(5.65) 


Maximum frequency of oscillation 


The maximum frequency of oscillation is a criterion for a device’s ability to amplify 
power. Because the maximum power gain at any frequency is obtained by conjugately 
matching the input and output ports, the MAG (see Equation (5.62)) is an obvious 
choice. The maximum frequency of oscillation fimax is then the frequency where MAG 
drops to unity: 


MAG( fax) = 1. (5.66) 


However, we had seen that MAG only exists when Rollet’s stability factor k > 1. This 
raises a practical problem — for many high-performance microwave transistors, k < 1 in 
the whole measurement range, and so fmax cannot be determined using Equation (5.66). 

Another customary definition therefore makes use of Mason’s unilateral gain U (see 
Equation (5.65)), which does not have this limitation. fmax is then understood as the 
frequency where U drops to unity: 


U(fmax) = 1. (5.67) 


Note, however, that Equations (5.66) and (5.67) will generally not yield the same result, 
so it is important to check the definition used when comparing fmax values. 


Noise in two-ports 


Noise phenomena 


Any electronic component exhibits electronic noise, provided that the absolute tempera- 
ture is T > 0. There are several physical origins, which have been discussed in Chapter 
2 in the context of active devices, so only a brief summary shall be given here. 

Noise occurs in all contexts where carrier motion or carrier density is stochastic: 


As there is always a random thermal motion superimposed on any charge carrier 
movement, thermal or Johnson noise is omnipresent in all conductors with non-zero 
resistance. The rms voltage generated by the thermal noise in a resistor of value R 
is Vins = V4kT RB, where k is Boltzmann’s constant and B is the measurement 
bandwidth. 

Emission of charge carriers over energy barriers is equally a stochastic process, and 


its associated noise mechanism is called shot noise. The rms current generated by a 
current of magnitude J flowing over an energy barrier is ims = /2q/B, where gq is 
the elementary charge and B again the measurement bandwidth. 
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= 


Thévenin equivalent circuit of a noise resistor terminated by a load of equal value. 


e Examples of noise due to random fluctuations in carrier density is generation- 
recombination noise or noise due to trapping and de-trapping processes. Unlike 
the first two noise phenomena, which can be considered to have a spectral density 
independent of frequency (‘white noise’), these processes produce noise spectra with 
low-pass behaviour, and cutoff frequencies in the Hz—MHz range. 

e Another example of noise generated by random changes in carrier densities is 
avalanche noise, due to carrier multiplication effects in high-field regions. This pro- 
cess also produces a low-pass limited noise spectrum with a cutoff frequency in the 
GHz range. 


For the discussion of noise in linear two-ports, however, the physical origin of noise 
is irrelevant. In fact, we will occasionally assume in the following that noise is always 
thermal in nature. Thermal noise has an interesting property. Consider that the squared 
magnitude of the noise voltage phasor generated in a resistor R in a bandwidth B is 


(IvnI?) = 8kT RB, (5.68) 


which corresponds to the rms value of ’4kT RB mentioned above. The Thévenin equiv- 
alent circuit of the noise resistor can then be drawn as in the box of Figure 5.11. The 
source is terminated by the same resistance R, so that the available power is delivered. 
As power is related to the peak voltage as 0.5V2 /R, and the voltage drop across the 
resistor iS U,/2, the resulting available power due to the thermal noise in resistor R in a 
bandwidth B is 


(|val?) 


N= 
8R 


=kTB. (5.69) 


The available noise power of a resistor is hence independent of the resistor value and 
depends only on the absolute temperature T and the measurement bandwidth B. 


Noise figure 


We will now, in a general form, consider what happens to a signal as it traverses a 
noisy two-port. Figure 5.12 shows an arrangement where the two-port is connected to 
a generator (source) with generator resistance Rc and a load Ry... For now, we assume 
that we deal only with real impedances and that both generator and load constitute a 
power match, i.e. Rin = Ro, Rout = Rv. The condition of power match at the input will 
be dropped further down. 
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Generic noisy two-port connected to source and load. 


We further assume that the noise at the input is only due to the thermal noise of the 
generator resistance, and that this resistor is at a temperature of Ty = 290 K.! The noise 
power at the input port is then 


N, =kToB. (5.70) 


In a | Hz bandwidth, this amounts to 4 - 1077! W or —174 dBm.” 

The two-port will also contribute noise. To simplify things, let us assume that all 
noise sources inside the two-port combine into a single noise source with power Neg, 
also located at port 1. Because the two-port has a gain of G, the noise power at the 
output is Np = G - (Nj + Neg). 

If there is an additional signal component S; present at the input, it equally is magni- 
fied by G. Sy = G - S;. We can now define the ratio of the signal-to-noise ratios at the 
input and the output: 


S2/Nr NiGS Ni 


Si/N1 _— SiG(Ni + Neq) _ , , Neq _ F (5.71) 


This defines the noise figure F of the two-port as the ratio of the signal-to-noise ratios at 
the input and the output, provided that the input carries thermal noise at a temperature 
To = 290 K only. Note that F > 1 under all circumstances. 

We also found the relationship between the equivalent noise power at the input and 
the noise figure: 


Neg = (F — IM = (F — I)RTOB. (5.72) 


If we assume that Neg is also thermally generated, Neg = kT,B, we can define an 
equivalent noise temperature for the two-port: 


=F =i, (5.73) 


Both F and T, can be used interchangeably to characterise the noise performance of 
two-ports. The use of 7, is customary in cases where F is very close to 1, for example 
in low-noise amplifiers (LNAs) for satellite applications, while F is more popular for 
general applications. 


' 290 K is the standard temperature for noise calculations. 
2 1 dBm is one decibel relative to a power of 1 mW. 
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Cascaded noisy two-ports. 


Noise figure of cascaded two-ports 
The situation depicted in Figure 5.12 is too simplistic for practical applications, because 
amplifiers or receivers generally consist of several stages. Let us consider next what 
happens when several noisy two-ports are being cascaded. This is shown in Figure 5.13. 
Again we assume that all ports are power-matched, which is an important restriction, 
but is made here to simplify calculations. 

First, we calculate the noise at the output (delivered to the load RL), assuming that 
for each two-port, the noise sources are combined into equivalent noise sources at its 
input. Then, 


No = G3 { Neg,3 + G2 [Neq,2 + Gi (Neq.1 + N1)]}. (5.74) 


Now we assume that all noise sources are being transferred to the input of the cascade. 

The equivalent noise source there will have a noise power of 

N eq,2 N eq,3 
G| GiG2 


Neg, tot _ Neg, 1 ae (5.75) 


Recalling Equation (5.72), we finally calculate the noise figure of the cascaded two- 
ports: 


Neg, tot Fy-1 Fj-1 
Fiop = 1+ —— = Fi t+ + — 4 5.76 
tot M1 1 Gi GiGs ( ) 
In a more generalised form, the total noise figure of a cascade of n stages is 
n EF. 1 
Ne 

Fot = Fi + 2 = (5.77) 

i=2 II Gx 

k=1 


This is the famed Friis equation [11] which in effect postulates that in a receiver 
chain the overall noise figure is approximately the noise figure of the first stage, pro- 
vided that that stage has sufficient gain — an important observation for the design 
of LNAs. 

A consequence of the Friis formula is that in any amplifier, a low noise figure without 
sufficient gain is meaningless as the noise of the following stages will take over. It 
is therefore useful to combine noise figure and gain in a single figure of merit. This is 
the noise measure according to Haus and Adler [19]. When using noise figures, it is 
written as 

F-1 


i= — 
1=VG 


(5.78) 
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where F is the noise figure and G the gain. The noise figure of an infinite chain of 
identical transistors is then 


Fo =M +1. (5.79) 


The noise measure can also be written in the form of a noise temperature: 


Th 


MeL _. 
eS T= 1/6 


(5.80) 


where Ty is the equivalent noise temperature of the two-port (see Equation (5.73)). 


Noise figure with arbitrary generator admittance 


We will now abandon the condition that all ports are power-matched by allowing arbi- 
trary terminations for the input port. The source admittance Y g, is expressed by its real 
and imaginary parts, Y,; = Gs + j Bs. As before, we assume that all noise sources 
internal to the two-port can be combined at the input. Due to the arbitrary generator 
admittance, we now have to split the noise sources into a noise voltage source v and a 
noise current source 7. This is shown in Figure 5.14. 

The equivalent noise voltage and current sources, v and i, are partially correlated. 
This can be accounted for by introducing a correlation admittance Y ¢ and splitting the 
current source into an uncorrelated part iy and a fully correlated part Y qv: 


i=iyt+Yov. (5.81) 


The real part of the generator admittance contributes a thermal noise current with the 
phasor: 


lir|?) = 8kToBGs, (5.82) 
(in?) 


where 7) = 290 K is the standard temperature and B the measurement bandwidth. 
Note that in Figure 5.14(b), all sources share the same source admittance Ys, so that 
it is sufficient to calculate v; for the case i; = 0: 


iti 
Nee peas (5.83) 
Ys 
Equivalent noise 
two-port 
- & 
—_ |__ 
_ GY y. DY - Noiseless 
is A * t : two-port 
‘Z oe | = 
(a) (b) 


(a) Noisy two-port with arbitrary generator admittance and (b) its equivalent circuit, introducing 
the equivalent noise voltage v and the equivalent noise current i. 
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We are really only interested in the powers, N « (Ivy I"): 


(ivi?) = (vj v7) (5.84) 


fo (li?) + (liv?) Giv*)— (i*v) 
> (el )+ IY s|? Y¢g Y% 


— (Elo (pl i4 Be (Ee) 


The noise figure can be expressed as 


N1, total vi |? 
= Miso oP ‘sak 
Miir (lit I?)/I¥ sI 
where JN} total 18 the noise power due to all noise sources and Nj j, the noise power due 
to the generator thermal noise at T = Tp alone. 


Combining Equations (5.84) and (5.85) yields 
(i?) | (vl?) | Yc (=) | 
F=14+-— +— Yolo} 1+ ==4+ (== : (5.86) 
(liri?) © (lin?) Ys \Ys 


Equation (5.86) can be written in a simpler form by introducing 


e an equivalent noise admittance 


8kToB 
e an equivalent noise resistance 
(lel?) 
Ry = 
8kToB 
e and remembering that 
(lérl?) 
Gs = 
8kToB 
from Equation (5.82). 
This results in 
Bi Kai 2 Yo, (%c\" 
F=1l+—+—|Y 1+—~+ |= 3 5.87 
Gs Gs es re \s wick 


The noise figure depends therefore on the generator admittance Yy = Gg + jYg. This 
was first pointed out by Rothe and Dahlke in 1955 [31, 32]. 
We can now search for the generator admittance where F will be minimal. This 


results in 
_ [Sn 2 
Gs opt = — Be (5.88) 
Ry 


Bs opt = —Be, 


where Bc is the imaginary part of the correlation admittance. 
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Introducing these terms into Equation (5.87), the noise figure can be written as 
follows: 


R 
F = 1+2Rn(Gsop + Go) + G* [(Gs — Gs.op)? + (BS + Bs.op)”]- 6:89) 


For Ys = Ys_opt, the minimum noise figure is 


Fin = 1+ 2Rn(Gs opt +Gc)=1+2Rn (cc + 4/ > = Be ’ (5.90) 
n 


and therefore finally 


Rn 
F = Foin + —— 


Ge [(Gs — Gs.op)” + (Bs ~ Bs.op0)"] (5.91) 


2 


Rn 
= Fmin + Ge Ys ~~ Ys opt 
To describe the noise performance of a two-port, we need the following parameters: 


e the minimum noise figure Finin, 
e the equivalent noise resistance Rp, 
e the noise-optimised generator admittance Y g 4p. 


For microwave applications, it is more customary to work with reflection coefficients 
instead of impedances or admittances. For this, we introduce a normalising admittance 
Yo, typically 20 mS, and a normalised noise resistance 7p) = Ry» Yo. Then Equation (5.91) 
turns into 


4rn [Ts = Feil 


F = Fain + : 5 (5.92) 
(1 =|Ps| ) [1 + T's opx| 
where 
_ Yo-Ys 
7 o+ Ys 
Plotting F = f(I's) on a Smith chart, the contours of constant F are circles. An 


example is shown in Figure 5.15. 


Associated gain 

The condition for noise-optimised source reflection coefficient 's = Ig opt is inde- 
pendent of the power matching conditions derived earlier, sy = I'|. The maximum 
gain under ‘noise matching’ conditions is therefore smaller than the MAG, which 
would be achieved when input and output of the two-port are conjugately matched. 
Here, 's = Is opt and the gain for the conjugately matched output becomes (see 
Equation (5.53)) 
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Example for ‘noise circles’ — contours of constant noise figures on the Smith chart. 
1=| V's opt |? a 1 
Sort | Sot? ———5 
| 1 — SiiV's opt | 1— | Pout | 
9 2 
\So1? (1 = |Ps.optl") 


= 2 2? 
[1 — SiiM's,opt|" — [S221 — S11T's,opt) + 512S21T's,opt| 


Gass = (5.93) 


since 
S21 S12T's opt 


Vout = $22 + —————.. 
i 1 — SiiVs,opt 


The available gain under noise matching conditions is called associated gain. 


Transistor amplifiers 


A brief historical discourse 


Amplifiers are such an integral part of any wireless communication system that we have 
to explicitly recall that in the first decades of radio, they were not used at all. It was not 
before the invention of the electron tube triode that amplification of alternating current 
signals became possible. Lee De Forest’s Audion tube (conceived in 1906 and patented 
in 1908 [10]) is probably the first example of an amplifying device. It was 24-year- 
old Edwin Armstrong, the prolific and finally tragic inventor of radio’s early days, who 
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De Forest’s three-terminal audion (from US Patent No. 879,582). 


actually explained its operation in 1914. As Figure 5.16 shows, the De Forest Audion 
already has the common three-terminal arrangement which we also find in transistor 
amplifiers. De Forest’s claim to have invented the tube concept independently of the 
‘thermionic valve’ patented by John Ambrose Fleming in 1905 [9] is doubtful, however. 


Critical amplifier parameters vary with respect to where in a system it is being used, 
and what kinds of signals are being fed through the amplifiers. For example: 


e When dealing with very low-level signals, for example immediately after the receiving 
antenna, or behind the optoelectronic converter in a fibre communication system, it 
is of paramount importance that the amplifier itself adds as little electronic noise as 
possible. 

At the output of a wireless transmitter, an amplifier should produce the required RF 
power with optimum efficiency and/or the required linearity, which in turn depends 
on the modulation format used. 

Other systems, such as in fibre-optic communications or ultra-wideband wireless sys- 
tems, require extreme bandwidths, with little variation of gain and group velocity 
between the lower and upper cutoff frequencies. 


These requirements can rarely be met simultaneously, so trade-offs have to be made 
which require a thorough study of the system specifications before the amplifier design 
is begun. 


Fundamental amplifier configurations 


Amplifiers will be discussed initially at a certain level of abstraction in order to make 
clear that the fundamental methods apply to FETs and bipolar transistors alike. The use 
of general methods will be emphasised rather than introducing a large number of circuit 
topologies. To this end, we will first introduce a generalised equivalent circuit for an 
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Generic three-terminal amplifying device. 


amplifying device, and then calculate its small-signal parameters — the y matrix in this 
case. Based on the y matrix, we develop general expressions for important amplifier 
parameters — the input and output admittances, and voltage and current gain. For differ- 
ent circuit topologies, we then calculate modified y matrices, which will immediately 
yield the amplifier parameters. 

The move from abstract concept to actual circuit implementation will also be made 
occasionally. You will see that the elements of real devices can be readily matched to 
the general equivalent circuit. 

The first important observation in amplifier design is that three-terminal devices can 
be used in three fundamentally different ways. Let us consider the generic three-terminal 
device depicted in Figure 5.17. 

We restrict our discussion to linear behaviour for the time being — corresponding to 
the small-signal case. The amplifying action is due to the voltage-controlled current 
source between nodes 2 and 0. The controlling entity is the input voltage v19 and the 
parameter is the transconductance g Pe which is generally taken as a complex value. 
This allows us to include additional phase delays in g Re 


8a gmoe 2°7 (5.94) 


Additionally, we included complex impedances Y 19, Y 5) and Y 5. These can later 
be correlated with the parameters of FET or bipolar transistor small-signal equivalent 
circuits introduced in Chapter 2. 


y matrix representation 
For the ‘hybrid zr’ equivalent circuit in Figure 5.17, the y matrix is easily calculated and 
will be used here. The advantage is that general amplifier properties, such as voltage and 
power gain or input and output admittance with arbitrary terminations, can be calculated 
once and can then be easily applied to the different topologies we will consider. 

The y matrix expresses the following system of linear equations with respect to the 
two-port shown in Figure 5.18: 


iy = yup + yy2 v2 (5.95) 
i2 = y21 vi + y22 v2 (5.96) 
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A two-port in y matrix representation terminated with generator impedance Yq and load 
impedance Y,. 


The y matrix [Y] can also be calculated from the scattering matrix [S] introduced earlier: 
[Y] = Yo- (11) — [8] (1) + ts)! (5.97) 


where [1] is the identity matrix and Yo the normalising admittance used in the 
calculation of the scattering matrix — usually 20 mS. 

First, consider the input admittance Y,. The output port is terminated by an 
admittance Y;. 


iy 


a 
VI] 
i7=—-v2Y). 
We find 
Y12 Y21 
Y, = yW1 -— ———— _.. (5.98) 
Yu + y22 


Likewise, the output admittance when the input is terminated with an arbitrary 
admittance Yq is 


¥12 Y21 
Y2 = y22 — ———_. (5.99) 
Yo+yu 
The forward voltage gain Ay = v2/v1 is 
—y21 
Ay = =——_. (5.100) 
YP ye 
And finally the current gain Ay = i2/i1: 
Y 
Ap aie (5.101) 


yir(v22 + ¥p) — yo viz 


The frequently used short-circuit current gain can be easily calculated from Equation 
(5.101) when Y;, > ov: 

Yat 
Yu 


n= 7 |(vy = 0) = (5.102) 


Common source/common emitter: node 0 as the common node 
The most obvious connection is to ground node 0. In FETs, this configuration is called 
common source; in bipolar transistors, it is called common-emitter configuration. 
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Generic amplifier configuration with node 0 grounded. 


The y matrix can be easily calculated (see Figure 5.19): 


yu =YiotLYy2 (5.103) 
yi2=—-Y yo (5.104) 
yu =8,—~ %12 (5.105) 
y22 = Yoo + Vp. (5.106) 


Let us first investigate the voltage gain using Equation (5.100): 


—Y 
tees vt m7 212 


& 
Yut y22 Yp+Yoot¥i 


(5.107) 


To facilitate interpretation, let us assume that Can is real, and that the feedback 
admittance is weak: gm — Yj © 8m, Yy t+ Y og + Yo © Vy + Yy9- Then, 


&m 


Ay * -————_.. 
Yut+Yo0 


For low frequencies, Y,; and Y 59 are real, and we find: 


e The magnitude of voltage gain in common-source and common-emitter stages will be 
approximately equal to the product of transconductance and effective load resistance 
(the parallel connection of external load resistance and device output resistance). 

e The output voltage lags the input voltage by 180°. 


A useful figure of merit is the maximum voltage gain a three-terminal device can pro- 
duce with node 0 grounded. We find it from Equation (5.107) by choosing Y, = 0 and 
assuming Y j7 < ¥ 99,8! 


g 
Av,max © ae (5.108) 
£20) 


Now let us take a look at the input admittance, using Equation (5.98): 


yi2 Y21 8m —Yy9 
Yy=yi- Set Hy ty (1+ #52 _), 
YL + y22 o - YutYotXi2 


or recognising from Equation (5.107) that the last term in parentheses is Ay: 


Y =Yigt Yi. — Av). (5.109) 
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Simplified hybrid z equivalent circuit of a FET with load admittance. 


Miller effect 

It is now time for a small practical example. In a FET, the small-signal equivalent circuit 
looks like Figure 5.20, if we neglect the series resistances. From comparison with Figure 
5.17, we recognise that 


Y 19 = JoCGs 
Y j2 = joCop 
Y¥ 49 = gps. 
The voltage gain is now 
8m—JeCacd &m 


Ay — nN ; 
YL + gps + s@Cop YL + gps 


for low frequencies. 
The input admittance is, using Equation (5.109), 


Y, = j@Cos + swCop (1 — Ay). (5.110) 


The feedback capacitance Cgp appears in parallel with Cgs, but is multiplied by the 

magnitude of the voltage gain, augmented by one — this is the dreaded Miller Effect, 

which we always have to be aware of in high-speed circuit design, because it may 

significantly increase the input capacitance. It was described as early as 1920 [29]. 
Going back to Figure 5.19, we calculate the output admittance to be 


Yo=Yy+Y (1+ fa} (5.111) 
ae Te YoetY¥iot¥i2/) , 


The term 
Sm— Fi A 
Yo + Yio + Yi2 

can be significantly larger than 1 and may act like a ‘reverse Miller Effect’. This has to 
be taken into account if a tuned circuit is connected to the output node, as is the case 
in typical tuned amplifiers. Figure 5.21 shows an example — the detuning effect of the 
feedback capacitance will be much larger than expected. 

The current gain of the common-source/common-emitter amplifier, finally, becomes 


A ieee SP as 
[|= . 
FeO. t+2., + Lio + Yao) + Yio @1 + Yap) 


(5.112) 


We can safely assume here that the real parts of all admittances are larger than zero — 
then the current gain in the quasi-static limit (f — 0) is positive. 
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Common-source amplifier stage with tuned load, and the output-referred Miller capacitance. 


Let us check the latter equation again with our FET equivalent circuit by calculating 
the short-circuit current gain: 


&m — J@CGD ~  §&m 


ho, = Ai|(Yy, > we) = x 
EL Jo(Cgs+Cop)  soCcs 


assuming that gm >> wCgp and Cgs >> Cop. 

This is the equation we had earlier used in Chapter 2 to estimate the transit frequency 
of a FET to be fr = gm/(27 Cos). 

Let us summarise our findings for the amplifier configuration with node 0 grounded — 
applicable to both the FET common-source and the bipolar common-emitter topologies: 


The configuration can provide substantial voltage and current gains. 

The voltage gain provides a phase shift of 180° in the quasi-static limit, whereas the 
current gain experiences no phase shift. 

Due to the presence of feedback, the input and output admittances always depend on 
the termination of the opposite port. 

For substantial voltage gains, the Miller effect has to be observed which can 
substantially increase the input capacitance. 


Common gate/common base: node 1 as the common node 

In the next set-up to be discussed, node | is grounded, and the input current is fed into 
node 0. Node 2 is still the output node. This applies to the FET common-gate and the 
bipolar common-base configurations. Again, we will first calculate the y matrix in a 


general form. For this, it is valuable to recognise in Figure 5.22 that vj = —vjo. 
vir = Esp Lio +e, (5.113) 
y12 = —Y¥ 4 (5.114) 
y21 ==Yo- 2s (5.115) 
y22 = Y 0 + Yp- (5.116) 
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Hybrid z equivalent circuit with node | grounded. 


This input admittance is then 

tut ia 
Yi Yao + Lia 
Let us simplify this expression somewhat. First, realise that in practical devices nec- 
essarily Be doe: otherwise it would not have a reasonable voltage gain in 


common-source or common-emitter configuration (see above). Further, let us assume 
that 4 12 <K Y,. Then, 


Y) =Viot t+ Ya) (5.117) 


a ee a 
il Em¥, +¥ 09° 


An interesting observation is that now Y 59 is the feedback admittance which determines 
the sensitivity of the input admittance on the load. If further, this admittance is very 
small, Y 59 < Y,, then 

My Fig t Sa 


Using the example of Figure 5.20, we find 


wCgs ao) 
ris (14) gg (1452) 
8m WT 


The input admittance is hence approximately the transconductance, unless we are oper- 
ating close to fr. In practical transistors, this will be a quite large value — much larger 
than the input admittance in the topology with node 0 grounded. 
The output admittance is 
cae.) ee : 
Fo t+ E10 2207 85 


Y= (5.118) 


Assuming again that g__ > Y 29, 


Yn 

§ 
1 
YotNYio 


Y2 ¥ Yio + 


Compared to the topology with node 0 grounded (common emitter or common source), 
the output admittance will be substantially smaller. 
The voltage gain is calculated as 


Age (5.119) 


More free ebooks : http://fast-file.blogspot.com 


Fig. 5.23 


Building blocks for high-speed analogue circuits 321 


Assuming again that g ae Y 59, this simplifies to 
& 


2m . 
Yi +YotY¥i 
Compare with Equation (5.107) — this is the same result, if Y }» < gm, which is a safe 
assumption. 


Finally, the current gain is 


Ay = 


TEE sn 
Yut¥i)YotViot gs) +¥20¥10° 


Ars (5.120) 


The magnitude of A; for this topology is hence less than 1. For greater simplicity, 
calculate the short-circuit current gain (YL —> 00): 

Fee 
Yoo + &,, + Y10 

1 


ho, = 


14 E10 
gm 
Using again the simple FET equivalent circuit in Figure 5.20, this reduces to 


1 
hz) = — 


ae 
Leg 

OT 
In other words, the short-circuit current gain of the common-gate and common-base 
configurations will remain independent of frequency until quite close to fr. 


Common drain/common collector: node 2 as the common node 
The last fundamental configuration of the generic amplifying three-terminal device 
(Figure 5.17) has node 2 as the common node (see Figure 5.23). In FETs, this will 
be called common drain; in bipolar transistors, this will be called common-collector 
configuration. 

Again, first calculate the y matrix of this configuration. 


Yili =VYiotYp (5.121) 
yi2=—-Y jo (5.122) 
yn =—-Vio + £,) (5.123) 
y22 =YiotYorts,: (5.124) 

iy 1 Ya 0 

> oO _ i 

ry <2 

Yio ro Im Vi0 v Y20 
ms QO Vy 
2 
Oo -O 


Generic amplifier configuration with node 2 grounded. 
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Let us first consider the voltage gain Ay, when port 2 is terminated with a load 
admittance Y{. 


=i 
Ay = ya Fig Se = 14 Eu + E20 o: 45,125) 
Y+y2 Y¥pt+YiotYorts,, Yiot8,, 


In practical devices, the second term in parentheses will be small compared to 1, at least 
at lower frequencies, so that Ay ~ 1 (but always less than 1) — v2 follows v1, which 
is why this topology is also called a source follower or emitter follower for FETs and 
bipolar transistors, respectively. 

The input admittance is calculated to be 


aI. Yi +Yoo)¥ 10 
Yot+y2 7 Yet Fag Por es 


Yi = yu — 


= ee (5.126) 


at ¥ yp + —l0_, (5.127) 


because typically, at least at frequencies sufficiently below fr, g ne Y 49, and 
assuming that Y; >> Y¥ 49. 

Compare this to the input admittance of the topology where node 0 was grounded, 
Equation (5.109), and you will notice that the influence of Y jg is substantially reduced, 
while Y ;5 does not suffer from the augmentation due to the Miller effect. We can 
therefore state that the topology with node 2 grounded presents a much lower input 
admittance. 

Going to our usual FET example where the amplifying device is represented by 
the equivalent circuit in Figure 5.20, we can show that Equation (5.126) may have an 
unexpected result. In this case, 

JoCcs 
1+ 8m + J@CGs © 
YL + gps 
Now assume that gm >> wCGs, i.e. @ & wr, that YL is capacitive (YL, = jwC_), and 
that oCL > gps. 


Y, = joCep + 


wCgsC. 
gm + y@(CL + Cogs)’ 


or separating into real and imaginary parts, 


wo gmCcsC © (qe + Cgs) 


Y, = j@Cop 


Y= 


+C : 5.128 
gn + or(Cy + Cos)? ga, + @*(Cy + Cos)? o] ae 


We created an admittance with a negative real part! This can be useful, for example if 


we want to build an oscillator (see Section 5.5), but also very dangerous for amplifier 
stability. 


More free ebooks : http://fast-file.blogspot.com 


5.4.3 


Fig. 5.24 


Building blocks for high-speed analogue circuits 323 


Moving back to the more general case, let us calculate the current gain: 


¥i Wit oo) 


Aj = : 
Filo t Lat itt+s,) + El! s + Lap) 


(5.129) 


Comparing this with the equation for the current gain with node 0 grounded (Equation 
(5.112)), we see that the denominators are equal. If further g | >> Y 19, Y 12, which is 
typically the case, the two current gains have equal magnitude. 

Finally, the output admittance of the topology with node 2 grounded will be 
calculated. The input port is terminated with a generator admittance Yo. 


Yio + 8 Wo + ¥12) 


Y2=Yo + 
as YetYitYu 


(5.130) 


To interpret this equation, assume that ¥g >> Yo + ¥ 12, 8, > Yio. Then, 
Yao te.™ a3 


in most cases. Compared to the output admittance of the original topology which had 
node 0 as the common node (Equation (5.111)), we see that now we have a substantially 
higher output conductance. 

The combination of a very low input conductance (very high input impedance) 
and high output conductance (low output resistance) is the most important aspect of 
common-drain/common-collector topologies. 


Feedback 


Negative feedback is another important principle in amplifier design. In small-signal 
design, it is used for impedance matching purposes, to make an amplifier stable and to 
increase its bandwidth. The negative feedback amplifier was invented by Harold Black 
in 1927 [22]. 

The most important feedback implementations in high-speed amplifier design are 
shunt-shunt and series—series feedback, as shown in Figure 5.24. 


[YJ] [Y] 


[Y] [Y;] 


(a) (b) 


Feedback configurations: (a) shunt-shunt feedback and (b) series—series feedback. 
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Shunt—shunt feedback 
Case (a) is easily calculated using a y matrix representation, because the resulting y 
matrix is the sum of the individual matrices. 


= =f 
[Yr] = (Y1+[%l = > Yu,f = Y12 a (5.131) 
yoait y21,f 22 + 22, f- 
Series—series feedback 
Series—series feedback is better treated using a z matrix representation: 
Vp = Z11l1 + Z1212 (5.132) 
v2 = Z2101 + 222/12. (5.133) 


Conversion from y to z matrix is easy because the z matrix is simply the inverse of the 
y matrix: 


1 | yao | 
Z| = —— 134 
os A(Y) oe, yu d’ ae 


where A(Y) is the determinant of the y matrix and A(Y) = yj, 22 — y12y21.- 
Once the z matrices have been obtained, the resulting z matrix of the circuit with 
series—series feedback is the sum of the individual z matrices: 


Z1+Z Z12 +Z 
Ae VARS e7e 11 lif 212 ]) (5.135) 
221+ 221,f 222 + 222,f 
The conversion back from z to y matrix is equally simple: 
1 = 
iyj=—— = | (5.136) 
AZ) |=eer tin 


Use of feedback in small-signal amplifiers 
A very common feedback example is shown in Figure 5.25, where an admittance Y¢ is 
connected between the output and the input of a common-source amplifier. The FET 


YN 


, 1 


Example of shunt-shunt feedback in a common-source amplifier. 
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shall be treated using the generic equivalent circuit in Figure 5.17. The feedback two- 
port contains only one element, Ys. If [YQi] is the y matrix of transistor Q1, then the y 
matrix of the transistor with feedback is 


Y; — Y, 
ivi= bas + Yt y12,Q1 | (5.137) 
y21,q1 — Ye y22,q1 + Ye 


Unilateralisation 
An immediate application of this feedback technique is the elimination of the parameter 
y12. Choosing 


Yr = —Y12,Q1 


results in a unilateralised amplifier two-port where the input parameters no longer 
depend on the output load, and vice versa. This can be used to improve amplifier 
stability, and is referred to as neutralisation. 

There are several ways of achieving this. In narrow-band amplifiers, the usually 
purely capacitive feedback may be tuned out using an inductor. The inductor is chosen 
to form a parallel resonance with the feedback capacitor at the frequency of operation. 

A more elegant technique was invented by Harold A. Wheeler in the early 1920s 
for electron tubes. A current with equal magnitude — but opposite phase, as the current 
through the feedback admittance — is fed back from the output to the input node, where 
the two currents cancel out exactly. This is shown in Figure 5.26. In integrated circuits, 
the realisation of the autotransformer is hampered by the typically high losses of on-chip 
inductors. However, any kind of phase reversal will do; a particularly simple technique 
will be discussed further down in the context of the differential amplifier (p. 336). 


Port matching 
A very common task in amplifier design is matching, e.g. the input admittance Y; to 
the generator admittance: Y, = YG, where Y¢ is the complex conjugate of the generator 


Vop 


Amplifier neutralisation using an autotransformer. 
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—> O——— 0, 


Inductive emitter degeneration as an example for series—series feedback. 


admittance. While this is commonly done by cascading a matching network and the 
amplifier two-port, the same result can frequently be achieved by using feedback. 
With shunt-shunt feedback, the input admittance becomes 


(12,91 — Ys) (va1,e1 — Ys) 


Y; = yii,qi + Yt (5.138) 
” Y~ + yo2,Q1 + Ye 
Setting Y; = Ya and solving for Yr, we obtain 
Y= (yit.q1 — YG)CL + y22,91) — Y21,Q1912,Q1 (5.139) 


7 Y6 — Yu — (yt,a1 + 12,01 + y21,91 + 922,01) 


Inductive source degeneration 

Series—series feedback is also commonly used in matching problems. A practical exam- 
ple is shown in Figure 5.27. An inductor is inserted into the source lead of a FET. This 
is referred to as inductive source degeneration, and may equally be applied to bipolar 
transistors. We shall now investigate its effect on the input impedance. For simplicity’s 
sake, we assume that for transistor Q;, Y ;, and Y 59 can be neglected. For a general 
impedance Zr in the source lead, the input impedance becomes 


v 1 g 

z= = 42 (14+ ee). (5.140) 
it Yio 

In the specific case, Zp = ywL. Using the simple FET equivalent circuit in Figure 5.20, 

further Y jy = ywCgs. Recall that ap = gm/Cgs, and we find 


Zi, =o7rl+y (ot - ) : (5.141) 


wCgcs 


The inductance hence creates a real part in the input impedance. 


Bandwidth improvement 
Both shunt—shunt and series—series feedback can be used to increase bandwidth, 
however at the expense of maximum gain. 
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Two-port in y matrix representation with shunt-shunt feedback. 


Let us investigate the use of shunt-shunt feedback. First, realise that the influence 
of the generator impedance needs to be included. Figure 5.28 shows the correspond- 
ing schematic. We are interested in the voltage gain between generator and load. For 
Y-¢ = O, this is 


v2 Av 
Gv=—= (5.142) 
vo = 1+ Ze(yi1 + Avyiz2) 
where Ay is given by Equation (5.100). With Yr 4 0, we obtain 
Y, 
fact 
Ay =—A y21 5.143 
Vv v Yr ( ) 
yo2 + Yi 
A 
v= v (5.144) 


1+ Ze[yu + Avy + Ye(1 — AY] 


The noteworthy feature here is that Y; appears magnified by | — A‘, in the denominator. 
We found this already in a different context when discussing the Miller effect. 

The bandwidth enhancement effect can be seen in a simple example. The amplifying 
device shall be the simple FET from Figure 5.20. Then the y matrix is, with some 
appropriate simplifications, 


yu = so(Cos + Cop) © y@Ces 
yi2 = —J@Copd 
y21 = &m — J@CGD © 8m 


y22 = gps + Jw@Cop © gps. 
Let us further assume that 
Yr < gm. 


Then, 

| ee 
YL + gps 
Y 

ne L + 8ps 
Yu + gps + Yr 
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and 
7 Ay . 
1+ Zg[( — Ay) ¥i + yo(Ces — AYCen)| 


Gy 


We calculate the 3dB cutoff frequency as the frequency where the real and the 
imaginary parts in the denominator are equal. For Ys = 0, this is 


1 
Za(Cos — AvCcp) 


Next, we apply a purely resistive feedback, Ys = Gr: 


oc(¥ = 0) = 


1+ ZeGe(1 — A) 
Zg(Ces — AYCop) 


ac(Y¢ = Gr) = 


Frequently, Gr < (YL + Gps) and therefore Ay ~ Ay. The resistive feedback hence 
results in a very substantial bandwidth enhancement by the factor | + ZGGr(1 — Ay). 
The low-frequency gain, however, decreases to 


Ay 


Gy(o > 0) = ; 
vi )= Ty ZR — Av) 


so that the product Gy(@ — 0)wc = const. 

Larger bandwidth enhancement is possible if we allow Y¢ to have a negative 
imaginary part. This will be treated further down. 

Bandwidth enhancement using series—series feedback will be treated for the specific 
example shown in Figure 5.29. This is a transadmittance stage which converts an input 
voltage into an output current. The series feedback using Z¢ will first of all increase the 
input impedance to lower the load on the preceding stage. If it is made complex, it can 
be used for significant bandwidth enhancement as well. 


-— 
-= 
= i : 
Vo 
i 
> O- Q, 


Bandwidth enhancement using series—series feedback. 
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The transadmittance of this stage is 


a om (5.145) 


0) @ 
: 1+ smzi(1+22) 


if the FET is described by the simple equivalent circuit in Figure 5.20, and using wp = 


&m/Cos.- 
If now Zs is a parallel RC network, 


Re 
Zt S\, = 
1+ soR Cs 
and Cy is chosen, , 

Cr = ae 
op Re 
the frequency dependence in Y7 will disappear: 

_ __&m 
1+ gmRe , 


at least for this simple equivalent circuit! 


Yr 


Amplifier configurations with two transistors 


In the first part of this chapter, we have seen how the fundamental topologies we can 
realise with a three-terminal amplifying device have very different properties in terms 
of input and output admittances, as well as voltage and current gains. Further flexibility 
in tailoring amplifier properties is achieved when we combine two of the fundamental 
topologies. We will use generic FETs in order to help visualise the circuits. However, the 
fundamental concepts apply equally to bipolar transistors — in fact, to any three-terminal 
amplifying device, as was outlined in the more abstract discussion above. 


Common-drain/common-source configuration 

Suppose that we want to construct a buffer amplifier, which shall impose a minimal load 
on a generator, yet also have a significant voltage gain. This can be achieved with the 
combination of 


e a common-drain stage (node 2 as the common node), providing the high input 
impedance, and 
e acommon-source stage (node 0 as the common node), providing the voltage gain. 


Figure 5.30 shows the schematic of the common-drain/common-source (CD/CS) topo- 
logy. Q, is the common-drain transistor, Q2 the common-source transistor, Yz, is the 
load admittance and Yc the generator admittance. 

The two important aspects of this configuration are input admittance and voltage gain, 
as discussed. 

The voltage gain of the second stage is 


v2 Sm §m,Q2 
VA Yi +Y¥o gps.q2 + Yi’ 


2 


Av,q2 = 
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Schematic of a CD/CS amplifier cell. Bias arrangement has been omitted for clarity’s sake. 


using Equation (5.107) with the simplifying assumptions indicated there, and the FET 
equivalent circuit in Figure 5.20. 
We can now calculate the input admittance of stage two: 


Y1.@ =Yiog@tYi.q( — Av.q2) = so[Cas,e2 + Cap,q2(1 — Av,q)]. 
Because Q> is in common-source configuration, we observe the Miller effect, which 
may significantly increase the capacitance seen from node A into Q2. To judge the 
importance of this, consider the output admittance of transistor Q;, which appears 
in parallel to Yj,qg2 at node A. We can use Equation (5.130) with the appropriate 
simplifications: 

Ya,.a1 © Y20,.Q1 + 8n.q1 = 8DS.Q1 + 8m,Q2 © &m,Ql- 
The admittance from node A to ground can then be written as 
Ya = 8m,qi (1+ jota), 


where Ta is the characteristic time constant of node A: 


_ CGs,q2 4 Cop,Q2 
§m,Ql §m,Q1 


TA (1 — Ay,q2). 
As long as (27t,)~! is significantly outside of the intended frequency range of 
operation, its effect can be neglected. 

The concept of the characteristic time constant of internal nodes is very helpful in 
high-speed design, especially when tracking down reasons for unexpected limitations 
in performance. 

Here, the situation may not be so bad, because the high capacitance seen into Q2 is 
compensated for by the high conductance seen into the output of Q, in its common- 
drain configuration. 

The voltage gain of stage Q is Equation (5.125): 


= 
a a Yi + Yoo.a1 ~ __&m,Ql _ 1 
2 Vv] Yioait 8,01 gsmaitYiqr 1+ jets 
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The total voltage gain is finally 


Av,q2 


——— ; 
1+ jota es 


Ay = Ay,qiAv.q2 © 


ifo< ie Again we notice the importance of the characteristic impedance of node A. 
The input admittance can be calculated from Equation (5.126): 


Y 10,Q1 joCgs,ol 

Yy,xY ak = jw eae 
<S a §m,Ql i a §m,Q1 
Y1.02 Y1,Q2 


Recall that in our case, Y| Q2 is purely capacitive. As shown earlier, a capacitive load to 
a common-drain stage leads to a negative real part in the input admittance (see Equa- 
tion (5.128)). Whether this represents a problem for amplifier stability depends on the 
generator admittance value Yg. This should be kept in mind when investigating stability 
problems. 

Finally, we take a look at the overall current gain. Because the output current of 
the first stage feeds the input of the second, we expect the overall current gain to be the 
product of the two individual current gains. However, we have to observe that we always 
counted currents positive when they flow into the device (see Figure 5.18). Then, 


Ay = —Aj,Q1 ° Ar.qa- (5.146) 


The current gain of the first (common-drain) stage is given by Equation (5.129), 
observing that now the load admittance is the input admittance of the second stage: 


Ol? 

)+ Y10,q1 (11,92 + ¥20,Q1) : 
(5.147) 

while the current gain of the common-source stage is given by Equation (5.112): 


Y1,Q2(¥ 10,91 + 8, 


Arai = 
Y 12.91% 10,01 + Y 20,01 + Y1,Q2 + 8m.Q1 


_ (8.02 =F 12,Q2) Yu 
Yin“ t+ 8 99 + X10,Q2 + X20,Q2) + Y10,.Q2%1 + ¥20,Q2) 


Aq,Q2 , (5.148) 


where Y__ is the load admittance connected to the drain of Qo. 

Since the explicit calculation of Ay presents considerable difficulty, let us make a 
number of simplifying assumptions. First, we make the two-ports unilateral, i.e. we 
assume Y ;, = 0. Then, we assume that the load admittances are always much larger 
than the elements Y 59 for both transistors: Yj,.Q2 > Yo9.Q1, YL > Y29,qQ2- Equation 
(5.146) then has a very simple solution: 


g g 
a= 22 [pe Se (5.149) 
Y 10,22 Y 10,01 


For further interpretation, turn again to our simple FET equivalent circuit (Figure 5.20), 
and recall the transit (cutoff) frequency wr ~ gm/Cos. Equation (5.149) can then be 
written as 
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Darlington amplifier configuration. 


w (a) wo 
Ap = — (FERRO 4 EO). (5.150) 
a) a) 
Assume now that 7.91 = @7T,Q2 = or. Form < aor, the current gain is now 


approximately the product of the current gains of the individual devices, but rolls 
off at —40 dB/decade, instead of —20dB/decade. The frequency where |Aj| = | is 


VJ 2/(V5 — Der = 1.272 a7. 


Darlington amplifier 

The CD/CS configuration is not quite the same as the popular Darlington [7] topology, 
shown in Figure 5.31. The difference is that in the Darlington amplifier, the drain of 
Q) is connected to the drain of Q2. While the goal is similar, there are two noteworthy 
differences: 


e The feedback admittance of device Q1, Y 17 9), is now in the path between the 
output and the input nodes, and not connected directly to ground as in the CD/CS 
configuration. Therefore, the Miller effect will be present at the input. 

e The output current of Q; now also flows through the load. This changes the current 
gain equation. Using the same strong simplifications as in deriving Equation (5.150), 
we now find 


Ay = — [PROUT yy (FEO 4 MEO)). (5.151) 
a) o a) 

Compared to the CD/CS amplifier, the Darlington has slightly more short-circuit 
current gain close to wr. If again both transistors are equal and equally biased, the 
frequency where |A | = 1 is 2wy. This is why this configuration is sometimes also 
called fr doubler. The expression should be taken with a grain of salt. Remem- 
ber that fr is derived here from current gain, and that we neglected the feedback 
admittances in calculating Equation (5.151). The Miller effect disadvantage of the 
Darlington stage therefore does not show up in the simplification, but can signif- 
icantly affect circuit performance for small values of YL. Further, the current gain 
rolls off with —40 dB/decade, which may lead to stability problems when feedback is 
applied around the stage. So your mileage may vary. 
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Battjes fy doubler circuit. 


Battjes fr Doubler 

The well-known circuit shown in Figure 5.32, patented by C. R. Battjes [2], is essen- 
tially a Darlington amplifier (Q;, Q2) combined with a current mirror (Q2, Q3), which 
makes sure that both transistors in the signal path are operated with the same current. If 
they are also of equal size, they will have the same transit frequency. The circuit shown 
uses bipolar transistors (as in the patent), but the concept equally works with FETs. 
Note that the input capacitance of Q3 needs to be accounted for — if Q;/Q3 and Q2 are 
supposed to have the same current, then Q2 and Q3 need to have the same size, and the 
effective capacitance attached to node A approximately doubles (neglecting the Miller 
capacitance seen into Q2). 


Cascode amplifier 
The cascode amplifier is a combination of a common-source (or common-emitter) with 
a common-gate (or common-base) topology. It was conceived as a way to overcome the 
Miller effect and first described in 1939 using two triode tubes [21], where the cathode 
of tube 2 was series-connected (‘cascaded’) to the anode of tube 1. The term cascode 
hence refers to cascaded anode. Figure 5.33 shows a cascode realised using FETs. 

Let us assess the input admittance first. Because Q; is in common-source configura- 
tion (node 0 grounded), we use Equation (5.109): 


Yi = Yi0.91 + X12,01 (1 — Av,qu)- 


When calculating Ay,gi, we recognise that the load admittance is the input admittance 
of Q2 at node A: 


8m,Q1 ~ 12,Q1 


Ay, 1= o 
= Yie@+Y¥20e+ 212,01 


The input admittance Yj,Q2 is calculated using Equation (5.117), because Q2 is in 
common-gate configuration: 


YL + Yin 92 


Y1,Q2 = Yjo.q2 + (8.02 + ¥9,Q2) y 


t+ Y20,92 + Y12,92. 
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Schematic of a cascode stage built with FETs (bias arrangement omitted). 


Assume that g Qo > Yo0,q2> further YL >> Yio.Q9, ¥29,Q2- Then, the input 


admittance of Q2 simplifies to 
Y1,q2 © Yio.ge + 8 m,02" 
With this simplification, the voltage gain of Q, is then 


8m.Q1 ~ Yy01 
Y 10,92 + 8m.o2 + £20.01 + Y12,01 


Ay,ql = 


§m,Q1 


2 


Yi0,.Q2 + Late 


further assuming that 8m.Q2 > (Y 12.91 + Y20,Q1)- 
The input admittance of the cascode finally becomes 


§m,Q1 1 
YM *Yioqgi + Xia | 1+ 7 (5.152) 
8m,Q2 1 + —10.Q2 
§m,Q2 


Using our simple FET equivalent circuit (Figure 5.20) this finally becomes 


a ol 
Y, = jw | Cos,aqi + Cep,ai | 1 + em —— ]|- 
gum 1 iS 
The suppression of the Miller effect is simply explained by the low voltage gain of the 
common-source stage —2m,Q1/&m,Q2 for low frequencies. Frequently, transistors Q1 


and Q» are chosen the same size, and since they share the same drain (or collector) 
current, it follows that gm,.qi = 8m,q2 and Ay,qi = —1. 
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The calculation of the output admittance starts with the output admittance of tran- 
sistor Q2, which is in common-gate configuration (node | grounded), using Equation 
(5.118): 


Y¥2.Q1 + Y 10,92 


Y2.a1 + Y 10,92 + Y20,92 + 8n.Q2 


Y2 = ¥2,Q = Yi2.0+ 220, 


where YQ; is the output admittance of transistor Q; in common-source configuration 
(see Equation (5.1 11)): 


8m.qi ~ %12,Q1 


Y2.Q1 = Y20,.91 + Yi2,91 {+ 
Q Q Ye +Yioai t+ Yi2.a1 


where YG is the admittance terminating the input port. We simplify the expressions by 
assuming that the feedback admittances are small and the corresponding terms can be 
neglected. Then Y2,Qi © Y 59 Q; and the overall output conductance becomes 


=] 
§m,Q2 + Y0,Q2 


(5.153) 
Y 10,q2 + Y20,01 


Yo © Yn9.q2 (: a3 
Because g . /Yo is the magnitude of the maximum voltage gain in common- 
source configuration (see Equation (5.108))), an additional sensible assumption is that 


§m,Q2 > Y¥40,Q2: Then, 


-1 

g 

&m,Q2 

¥2 © Yoo op ( 1+ ———"—__ (5.154) 
iQ Y 10,q2 + X20,01 


In our simple FET example, we finally find 


-1 
&m,Q2 
Ya ~ gos.ca(1+ ) : 
q gps.qi + J@CGs,Q2 


and for the quasi-static case, w — 0: 
&DS,Q2 
eo §m,Q2 
8DS,QI 


Yn 


The output admittance is therefore significantly reduced compared to the common-gate 
or common-source configurations. 
The voltage gain of the cascode stage, Ay = Ay,q1 - Av,qz, IS 


~ ——Sna1__ Eno * X20.02 (5.155) 


YL+¥o0.Q2 Smoot Y10.Q2 
neglecting the feedback admittances. In the FET example, 


&m,Q1 _ 1+ gps,Q2/8m.@ 
YL + gps,qQ2 1+ j@/or 
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or provided that gm,q2 > gps,q2 and w K wy: 


a &m,Q1 

~  ¥et+ 8DS,Q2 
In summary, the cascode configuration provides a comparable voltage gain to the 
common-source topology, but its input admittance is significantly lower due to the 
reduction of the Miller capacitance and its output admittance is significantly higher. 

Finally, an important side effect of the cascode shall be pointed out here: the real part 

of the output admittance may become negative. In practical devices, the assumption 
that parameter Y 5, is purely real is not correct; a better approximation is Y 59 = gps + 
joCps. If we insert this into Equation (5.153) and separate real and imaginary parts, 
we find that the real part becomes negative if 


8DS.Q1 * 8DS,Q2 < © Cps,q2 - (Cps.qi + Cas,Q2): 


assuming that gm.q2 >> gps,qi along the way. Frequently, this can lead to amplifier 
instabilities, but it may also be used to compensate losses in travelling-wave amplifiers, 
as will be discussed later. 


Differential amplifiers 


An important component in many high-speed electronic circuits is the differential 
amplifier. One of the most influential pioneers of biomedical engineering, Otto Schmitt, 
is frequently held to be the father of the differential amplifier topology [34] — the ability 
of the differential amplifier to reject common-mode signals at its input is crucial for 
the measurement of weak bio-electric signals. Incidentally, he also invented the Schmitt 
trigger circuit. 

A generic differential amplifier topology realised with FETs is shown in Figure 5.34. 
A first noteworthy difference between the amplifiers discussed so far is that the input 
and output voltages are not referenced to ground, but to the other input and output 
electrodes, respectively. 

Figure 5.35 shows the small-signal representation of the differential amplifier, where 
the transistors are represented using the generic small-signal equivalent circuit from 
Figure 5.17. The transistors are identical. 


Differential mode 

Any combination of nodal input voltages va = vy + UX, UB = Vp + vg can be split into 
a differential mode (vi, = —vg, vk = vg = 0) and acommon mode (vk = ug, vA = 
Up = ()). First, we concentrate on the differential mode. To calculate the voltage of the 


common mode vp, we first loop through va, v1,1, v1,2 and ug: 


U1,1 — 1,2 
—vU, +11 —vu2-v, =0> va = a ee 


On the other hand, vp = vs — v1.1 and therefore 


Vii + U1,2 
5 : 


vo = 
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Fig. 5.34 Generic differential amplifier topology. 
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Fig. 5.35 Small-signal representation of the differential amplifier. 
For symmetry reasons, vy = —vp also implies vj, = —vj,2 and hence 


vo = 0 


in differential mode! The common node A constitutes a virtual ground. This is a very 
important concept in high-speed circuit design, as it dramatically reduces problems with 
common-node impedances, such as in bond wires to ground, which otherwise may lead 


to a variety of feedback problems. 
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Now that A is grounded, the two halves of the differential amplifier reduce to standard 
common-source (or common-emitter) circuits which we have already analysed. The 
input voltage to the left half is vj.) = v1 /2, while the right half receives v1.2 = —v,/2. 

Using Equation (5.109) to calculate the common-source input admittance Y1 cs for 
the individual transistors, the differential mode input admittance is 

i _ Yics 


fia = ; (5.156) 
Cal 2 


Likewise, the output admittance is half the output admittance Y2,.cs for the common- 
source stage given by Equation (5.111): 


eet ae, (5.157) 


Equation (5.107) is used to calculate the common-source voltage gain Ay cs. The 
differential voltage gain is then 


v 
Avd= a = Av.cs. (5.158) 


Common mode 

In common mode, both input terminals have the same potential to ground: vk = vg. 
The individual transistors are connected in parallel then at input and output, resulting in 
the equivalent circuit shown in Figure 5.36, and their y matrices can simply be added. 
We arrive at an equivalent transistor Q. with the following y matrix: 


fyoe) = 2 Yioi + XYi21 —Yi21 
YQel = . 
Boies Loo Ei 


2m,1 


The parameters are those of the individual transistor. This problem can be treated using 
the results of the feedback discussion (see p. 323), converting the y matrix first to a z 
matrix, and adding the z matrix corresponding to Yo, which is 


2Y, 


v1 =Va=VpB Yo 2) 


Equivalent circuit of the differential amplifier under common mode excitation. 
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1 1 
_|Y¥o Yo 
Yo ¥% 


and converting back to a y matrix. 

Here, we will consider a quick solution using a simplified equivalent circuit where 
we neglect both Y ,,, and Y 59, for the transistor Q-. Let Ene 2gm,1 and Y ig. = 
2Y 19,1, a8 discussed. The voltage gain Ay cm for common-mode excitation is then 


es ——, (5.159) 


assuming 2m,1/Yo >> 1. The output voltage here is taken between one of the output 
terminals and ground — the differential output voltage for common-mode excitation is 
0, provided that the circuit is perfectly symmetrical. The voltage gain under differential 
excitation, but with the output voltage taken between one of the output terminals and 
ground, is 


&m,1 
A == 0 
V,d,gnd 2Y,. 
The ratio 
ee _ | &m1 (5.160) 
Av,em Yo 


is the common-mode rejection ratio, a measure for the suppression of common-mode 
input signals. We see that Yo should be as small as possible. 


Neutralisation of differential amplifiers 

The fact that the output voltages of a differential amplifier are exactly 180° out of phase 
can be used to elegantly eliminate the effect of the feedback capacitances in the transis- 
tors. This is shown in Figure 5.37. The capacitances Cy, which have to be exactly equal 
to Cgp, feed a current into the gate nodes of the two transistors, which is equal in mag- 
nitude, but of opposite sign, compared to the currents flowing through Cgp, cancelling 
these capacitances. 


A more complex differential amplifier example 

Differential amplifiers for high-speed applications are frequently more complex and 
exploit the special properties discussed in the section on basic amplifier topologies using 
two transistors. Figure 5.38 shows a cell common to many high-speed differential ampli- 
fiers. Transistors Q; and Q2 are in common-collector configuration, connected to the 
transistor pairs Q3, Q5 and Q4, Q6, respectively, which form a differential cascode. 
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gle 6, 


(<4 


Differential amplifier with neutralisation. 


A more complex differential amplifier example. The dashed line indicates the symmetry plane; 
all nodes along this plane are virtual grounds. 


All nodes along the median, which is indicated as a dashed line, are virtual grounds, 
provided that the circuit is driven fully differentially. This is particularly interesting 
for the bases of Qs and Q¢, because proper grounding of the base terminal can be a 
problem in cascode stages — here, it is easy due to the virtual ground property. Equally, 
the emitters of Q3 and Q4 are properly grounded. The DC bias voltage terminal, Vcc, 
is also an RF ground, facilitating RF/DC decoupling. 
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Source-coupled amplifier schematic (bias elements omitted). 


These advantages lead to an increasing use of differential topologies in micro- and 
millimetre-wave circuits. Drawbacks are the increased power consumption due to the 
doubled component count and the increased area consumption. Another problem may 
be on-wafer testing, due to the necessity for differential probes. 


Source-coupled amplifier 


The amplifier topology shown in Figure 5.39 has, at first glance, a configuration very 
similar to the differential amplifier. We immediately recognise the source-coupled pair 
and the common-current source. However, the amplifier is driven single-endedly and 
also has only a single output. Upon closer investigation, Q; is in common-drain and Q2 
in common-gate topology. 

The idea is therefore very similar to the CD/CS amplifier discussed earlier. The 
common-drain input transistor creates a low input admittance, while the common-gate 
stage delivers the voltage gain. The Miller effect is eliminated, and the input is well 
isolated from the output. 

Compared to the CD/CS amplifier, the input admittance is higher, because the input 
admittance of the common-gate transistor Q2 is much higher than that of a comparably 
biased common-source transistor: Yi,.q2 * Y jo.q2 + g,,3 See Equation (5.117) with 
Y_ > Y20,qQ2- The input admittance of the source-coupled amplifier is then 


Yio,a1 


Mi = Zio ot (5.161) 


ics 8m.qo + Y 10,Q2 
8mai + Y 10,01 


Using our simple FET equivalent circuit, Y }g = yw@Cocs and 


w 
tnt Lio=sm(14+s). 
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If the transistors therefore have the same wry, the input admittance is 


Cos,Q1 
ie &m,Q2 


&m,Q1 


Y| = ja | Cop,ai + 


It is purely capacitive and does not show the risk of a negative real part, which the 
CD/CS amplifier had posed. 

The circuit can also be compared to the cascode — the source-coupled amplifier has 
a lower input admittance, is non-inverting and requires a lower supply voltage than the 
cascode, but the cascode requires less current, because the current through the common- 
gate stage is recycled in the common-source transistor. 


Tuned amplifiers 


Tuned amplifiers are commonly used at micro- and millimetre-wave frequencies when 
the fractional bandwidth is small. The fractional bandwidth is the required opera- 
tional bandwidth divided by the centre frequency. For example, the 24 GHz license-free 
ISM band has a total allowed spectral width of 250 MHz, so any amplifier will need 
sensibly only a fractional bandwidth of 10~7. Other applications, such as emerging 
ultra-wideband sensor and communications standards, will have fractional bandwidths 
which are orders of magnitude larger — the design of amplifiers for such systems will be 
treated in the next section (p. 350). 
A typical tuned amplifier will use three fundamental circuit techniques: 


(i) A resonant load — the load admittance goes through a minimum at the frequency 
of operation, maximising the voltage gain for a given transconductance. 
(ii) Complex conjugate match at the input, ensuring that the available power from the 
source is delivered to the amplifier. 
(iii) Complex conjugate match at the output, ensuring that the available power from the 
amplifier is delivered to the load. 


For LNAs and power amplifiers, other matching strategies may apply for the input and 
output ports, respectively. These will be treated in the sections on LNA design (p. 365) 
and power amplifier design (p. 376). For now, we assume that achieving the maximum 
gain is our objective. 


Resonant loads 
Let us first investigate the resonant load, using the simple example of Figure 5.40. The 
schematic also indicates the generator and the equivalent input admittance of the fol- 
lowing stage — it is essential to include at least the next-stage input admittance in the 
calculations, and due to feedback, the generator admittance will also have an effect, 
albeit more weakly. 

The admittances ¥2.Q; and Yj,Q2 can typically be represented by a conductance in 
parallel with a capacitive reactance (exception — if the following stage is a common-gate 
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Y1,02 
Generator Load 


Tuned amplifier stage with resonant load. 


or common-base stage, the reactance may be inductive). These elements are absorbed 
into the load. The resulting reactances of the tank circuit are then 


Gr= RO + Re(¥2,91 + Y1,qQ2) 


1 
Cr=CLt+ pnt ¥2,Q1 + Y1,Q2) 
Ly = Lyi. 


The transfer function of the voltage gain is 


& 1 
Ay(@) = Ze : 7 (5.162) 
Ti+) (of - star) 
which has its maximum at 
1 
wo = ; 
: Ly Cr 
and its —3 dB corner points at 
Gi 1 G 
W1 = f + = - : 
2 4c, LrCr 2Cr 
The bandwidth between the —3 dB points is therefore 
G 
Mai = Ga (5.163) 
CT 


This equation can be used to choose the proper Gry for the required bandwidth of the 
amplifier. 
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Using Equations (5.162) and (5.163), we find the product of the voltage gain at w = 

q@o and the —3 dB bandwidth: 
iting hee, (5.164) 
Cr 

which is interestingly independent of frequency. This is due to simplifying assumptions, 
of course. In the ansatz for Equation (5.162) we used Equation (5.107) with the assump- 
tion that g ee Y 1», hence that feedback is negligible, which is no longer true at very 
high frequencies. 

A tuned tank circuit always bears the risk of amplifier instability. For the common- 
source amplifier in the example, we use Equation (5.109) to calculate the input 
admittance of the circuit, using the expression in Equation (5.162) for Ay(@): 


&§ 1 
Y=YiotYu Ieee rs ; 
Tity [ost - ree 


T 


Cc 1 
os 1 J [os are | 
=VYiptYotVy 


Gr C on ca 
I [os = ae | 


If, as is usually the case, Y }y * ywCgp, the third term in the sum has a negative real 
part for @ < wo. The risk of parasitic oscillations increases with increasing peak gain. 
Neutralisation measures as discussed already (p. 325) may become necessary in such 
cases. 


Input and output matching networks 
A common requirement in microwave amplifiers is that input and output admittances 
need to have a predefined value. There are two major reasons for this: 


(i) Ifthe input and output admittances are the complex conjugates of the source and 
load admittances, the source’s available power is transferred to the amplifier, and 
the amplifier’s available power is transferred to the load, resulting in the maximum 
power gain — this value is called the maximum available gain and will be discussed 
shortly. 

(ii) To avoid standing waves on interconnecting transmission lines, the lines need to 
be terminated by their characteristic impedances at least at one end. 


The characteristic impedance of a lossless transmission line is real; hence, input and out- 
put admittances are normally tuned to a purely real value where the circuit will interface 
with a transmission line. For internal nodes, however, this is not necessary — in fact, 
as we will see in our discussion of broadband amplifier techniques, at internal nodes 
impedance matching is often abandoned altogether, in favour of increased bandwidth, 
but with penalties in power gain. 

Because the Smith chart (see p. 295) is the most important tool in solving matching 
problems, we will conduct the matching discussions using scattering parameters. 
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Most importantly, we need to translate source and load admittances as well as two- 
port input and output admittances into reflection coefficients. This is easily done: 

Yo-Y Z-Zp 
~ Yot¥ Z+Zo’ 
where Yo = 1/Zo is the normalising admittance, which is frequently 20 mS (corre- 
spondingly Zp = 50 Q2), but can be chosen arbitrarily. 

We have seen that in two-ports which are not unilateral (yj2 4 0, correspond- 
ingly Sjz 4 0), the input admittance depends on the load admittance, and the output 
admittance depends on the source admittance. In general terms and with the two-port 


expressed as a scattering matrix, the input (I",) and output (12) reflection coefficients 
are (p. 300) 


r 


(5.165) 


Sy2So1T 
r,=S —____—. 5.166 
1 it 1 Sul ( ) 
Sj28211G 
I,=8 ———_., 5.167 
2 22 + 1- Sire ( ) 


where Ig and I’; are the generator and load reflection coefficients, respectively. 
For simultaneous power match at input and output ports, we need these coupled 
equations to be satisfied: 


S12S21T 
ra=S — 
G 11+ 1 = Ser. 
S12S21TG 
rr=S ——__.,, 
L Viele i= Sule 


where I* is the complex conjugate of I’. Solving these equations for the necessary 
generator and load reflection coefficients gm and Lm, we find 
2 
_ ch (Bi BE 
ICi] \ 21Ci| ICi/? 


P'G.m (5.168) 


with 
By = 1—|Sy/? + |Sul? — |A()/? 
Cy = Sy, — A(S) S5p, 


where A(S) is the determinant of the scattering matrix. For the load reflection 
coefficients, we find likewise: 


r Cr | Be P| Bi (5.169) 
Lm = , é 
"(Cal \ 21C2| |C2|? 


By = 1— |Si|* + |S22|* — JAS)? 
Cz = So2 — A(S) Shy. 


with 


Simultaneous input and output power match is not always possible, but requires a two- 
port to be unconditionally stable (see p. 303). 
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(a) (b) (c) (d) 
(e) (f) (g) (h) 


Generic L network topologies. 


Generally speaking, impedance transformation can be achieved using 


e reactances L, C, 
e transformers, 
e transmission line impedance transformation. 


At micro- and millimetre-wave frequencies, ‘true’ transformers based on coils are 
rarely used, because when realised on-chip using planar inductors, they tend to be very 
lossy, and additionally have high parasitic capacitances. So only impedance transforma- 
tions using reactive networks and transmission line impedance transformation will be 
discussed here. 

The most fundamental impedance transforming network is the L network, which can 
have any of the shapes shown in Figure 5.41. 

There is always more than one topology which achieves the desired impedance trans- 
formation. This is an important observation, because other considerations need to be 
taken into account also. For example, the input port may have to be DC-blocked, in 
which case a topology with a series C may be suitable(cases c, d, g, or h in Figure 5.41), 
or DC bias may have to be supplied through the port, in which case a series L and no 
shunt L are needed (cases a or e). Likewise, it may be advantageous to ground the input 
port at low frequencies, favouring a topology with a shunt L (cases b, c, or f). 

Figure 5.42 shows an example of an impedance matching problem, solved using sev- 
eral topologies. In all cases, the impedance in the lower left quadrant is the starting point 
and the centre of the Smith chart is the target. 


Path | uses an L in series with the start impedance and then a shunt L. 

Path 2 also starts with a series L, but a larger one, and then uses a shunt C. 

Path 3 starts with a shunt L, and then continues with a series L. 

Path 4 starts equally with a shunt L, but a smaller one, and then uses a series C to 
reach the required impedance. 


The several options are best visualised using the Smith chart. 
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Table 5.1 Matching a complex load (100 Q parallel to 2.5 pF) to 50 Q 
using different L network topologies, f = 1 GHz 


Path Components Bandwidth 


1 Lg 1=3.1 nH, Lp.2=8.8nH 508 MHz 
2 Lg, {=11 nH, Cp.7=2.75 pF 547 MHz 
3 Lp,1=26nH, Ls 9=7.9nH 561 MHz 
4 Lp |=6.1 nH, Cs 9=3.15 nH 324 MHz 


Example for multiple impedance transformation paths using L networks. 


Other aspects of matching networks need to be considered as well. This shall be 
done in an example, where a parallel RC load (R = 1009, C = 2.5 pF) is matched 
to Zo = 50Q using the different topologies in Figure 5.42. The results are shown in 
Table 5.1. 

First of all, we note that the matching bandwidth, defined as the difference between 
the frequencies where the reflection coefficient becomes |I'| > 0.32 (return loss less 
than 10 dB), is vastly different — path 4 has less than 60% of the bandwidth of the others. 
Also, component values may become impractically larger for on-chip implementation — 
for example, Lp for path 3. 
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(a) (b) 


Example of (a) a z-type matching network and (b) its decomposition into two cascaded L 
networks. 


These calculations have been performed using ideal components. In practice, large- 
value spiral inductors also come with considerable series resistances, which is another 
aspect to consider. 

zt networks are an extension of L networks — they are best thought of as being sepa- 
rated into two L networks, as shown in Figure 5.43. The first L network transforms to 
an intermediate impedance Zintermediate, Which is then transformed by the second L net- 
work to the desired value. z networks offer an additional degree of freedom, so we can 
additionally design for different matching bandwidths. They are additionally attractive, 
because they allow the absorption of interconnect parasitics into the matching network — 
e.g. bond pad parasitics on chip and in the package (or on the PCB board) can form part 
of C; and C2, while the bond wire inductance can be absorbed into L. 

Other combinations of L-type networks exist and can be useful for specific matching 
problems, but this is beyond the scope of this book. 

Figure 5.44 shows three examples of compact tuned amplifiers in an 80GHz fr 
Si/SiGe HBT technology [6]. The amplifiers share the same basic topology — three 
cascaded cascode stages with resonant loads and LC interstage matching using spiral 
inductors. Additionally, inductive emitter degeneration (Equation (5.141)) is used to 
assist the match by increasing the real part of the input impedance. The use of concen- 
trated reactances, even at millimetre-wave frequencies, leads to an extremely compact 
layout. 

Transmission line segments can also be used to transform impedances. Assuming 
lossless transmission lines, the input impedance looking into a transmission line of 
length / and characteristic impedance Zo, terminated by an impedance Zy, is 


ZL + J Zo tan (2 4) 


A + 7Z, tan (204) 


Z;=Z 


(5.170) 


where i’ is the wavelength on the transmission line, 


cO 


f Jeff 


N= 


A very popular example is the quarter-wavelength transformer. In case 1 = i'/4, the 
input impedance becomes 
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Fig. 5.44 


36 GHz amplifier --- 
40 GHz amplifier *« 
50 GHz amplifier 


Transmission/dB 


35 40 
Frequency/GHz 


Tuned millimetre-wave amplifiers in a Si/SiGe HBT technology, using LC loads and matching 
networks. (After [6]) 


ZL 
In other words, to match two impedances Za, Zp, they need to be connected with 
a transmission line which is 4’/4 long and has a characteristic impedance of Zp = 
/Zx Zp. Quarter-wave transmission line sections are also called impedance inverters — 
the reason is obvious from Equation (5.171). 


Transmission lines open up additional possibilities in matching. This is shown in 
Figure 5.45, again using the same start impedance as above: 


Z| (5.171) 


e In path 1, a transmission line section of impedance Zp =50 Q is used first to make 
the impedance real. The intermediate impedance is 14.5 Q; hence the quarter-wave 
section must have an impedance of /50- 14.5=26.9 Q. 

e Path 2 first uses a series inductance to make the impedance real, the intermediate 
impedance is 28.7 2. The quarter-wave section then needs to have a characteristic 
impedance of 37.8 Q. 

e Path 3, finally, uses a shunt inductance to make the impedance real (Zintermediate = 
100 2) and a quarter-wave section with Zp = 70.7 Q. 


Option 2 has the widest matching bandwidth, but the transmission line in option 3 is 
likely the easiest to realise. 


With increasing frequency, tuned amplifiers using transmission line segments become 
increasingly interesting, because spiral inductors are especially difficult to realise and 
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Matching examples using quarter-wave transmission line transformers. 


model, and the main objection against the use of transmission lines — their physical 
size in layout — becomes irrelevant as the wavelength shrinks. Figure 5.46 shows an 
example. The IC represents a three-stage fully differential amplifier for 77 GHz auto- 
motive RADAR systems, realised in a 190 GHz fy Si/SiGe BiCMOS technology [5]. 
The amplifier provides 16dB gain while consuming 90 mW from a 3 V supply. Thin- 
film microstrip’ lines (TFMSLs) are used here for impedance matching purposes. Due 
to the high frequency, the resulting IC is still very compact (740 x 540 m2? chip size). 


Broadband amplifier techniques 


Tuned loads and reactive impedance matching networks are not suitable for amplifiers 
with large fractional bandwidths, such as those used in high-speed fibre-optic systems, 
micro/millimetre-wave instruments, many military systems with high frequency abil- 
ity, or impulse-radio ultra-wideband systems. All of these applications need amplifiers 
where the gain must be flat over a wide frequency range (often the ratio of upper to 


3 In TFMSLs, the ground plane is realised on top of the substrate. This shields the signal line from the lossy 
Si substrate, but leads to very narrow signal lines. 
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Fully differential Si/SiGe HBT amplifier for 77 GHz, using tuned transmission lines. (After [5]) 


lower cutoff frequency exceeds the factor of two — multi-octave bandwidths), and almost 
always the input and output return loss also needs to stay below a specified value over 
the full frequency range. 

In the following section, we will discuss some common techniques which prove use- 
ful in the realisation of amplifiers with very large bandwidths using concentrated circuit 
components. Discussion of distributed amplification, which is also a very important 
concept for wideband amplifiers, will start on p. 354. 


Shunt peaking 

We have already emphasised the importance of the characteristic time constant in the 
discussion of multi-stage amplifier topologies (p. 330). We will see that broadband 
amplifier design always comes down to modifying these internal characteristic time 
constants. 

Consider the simple cascading of common-source amplifiers, shown in Figure 5.47 
together with a strongly simplified equivalent circuit. The load resistance and the input 
capacitance of the following stage are combined into an equivalent impedance to ground 
Zeq- Using Zeq, the transadmittance of the cascaded stage can be expressed as 


17 RL&m,18m,2 
Yr=— == —8m,18m,2Zeq = “St 


__ *L8m.18m,2__— (5.172) 
Vv] 1+ joRLCos,2 
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Intermediate node of two cascaded common-source amplifiers, with small-signal equivalent 
circuit. 


‘. L 
vy C iy 
R, || Cesi1 ie os 
L | == | R, at c2 1 
Y SY 
Vv: Vv, 
C2 Im M1 Za 9m,2 Vgs,2 


Cascade connection of two common-source amplifiers with shunt peaking inductor. 


Obviously, RLCgs.2 is the characteristic time constant of the intermediate node, which 


limits the bandwidth to 


I 
AZ (5.173) 


oO, = = ; 
RiCes,2. 8m2 Ri 


using wr = gm/Ccs. 
We will now partially compensate the capacitive reactance by connecting an inductor 
in series with the load resistor (see Figure 5.48). The transadmittance now becomes 


Yr = —8m,18m,2Zeq 


JoL 
—s 
—_ R L ; 5.174 
§m,1§m,2 Ly _ oLCys.2 + JORLCes,2 ( ) 
Introducing 
L ReCs: 
T= ;m= pCes2 1 
I; L WT 
we rewrite Equation (5.174) [25]: 
Cola 
1+)7{—]m 
on 
Yr = —8m,18m,2Ri 5 : (5.175) 
1-—{ — eo if 
W{ W| 


The new —3 dB cutoff frequency is 


—m2 —yy2 2 
Or = | (SE +msi)s (Sp +m+1) +m?. (5.176) 
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Normalised transadmittance of an amplifier cascade with shunt peaking versus frequency, for 
different values of parameter m. 


Equation (5.176) is maximum for 


m= V2, 


or finally 
1 


20 


Figure 5.49 plots the normalised transadmittance Yy/ (gm,1 &m,2RL) versus the nor- 
malised frequency, and for several values of m. We note that 


(5.177) 


T 


e we can achieve 1.8-fold increase in bandwidth; 

e the increase in bandwidth comes at the expense of gain flatness; 

e however, form = 1 + J/2, the response becomes maximally flat with only a marginal 
decrease in bandwidth. 


Feedback techniques 

We had already seen (Figure 5.29) that a parallel RC combination in series—series feed- 
back can be used to completely eliminate the dominant pole in the frequency response of 
the transadmittance. Let us consider a somewhat more complicated example now where 
the amplifier is loaded by a complex load formed by a resistor and a capacitor in paral- 
lel — the typical equivalent circuit of a following amplification stage. The small-signal 
equivalent circuit is shown in Figure 5.50. The voltage gain is 


Agi osm La (5.178) 
(+ jon) [1+ # + jo(s+ 22)| 
where 
ts = RsCs; tL = RLCL; oT = ent 
Cos 
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Bandwidth enhancement using series—series feedback. 


The enumerator term can now be used to cancel one of the denominator poles: 


e If 1+ swt, dominates, then ts = Tt, is the proper choice. 
e Ifthe second term dominates, then choose ts = wy ! This corresponds to the solution 
already discussed in Equation (5.145). 


Distributed amplification 


The amplifier topologies discussed so far employed concentrated circuit elements and 
are as such not very different from topologies employed at lower frequencies. The dis- 
tributed nature of components, especially interconnect lines, only comes in at the layout 
stage. In the wideband amplifier technique we will discuss now, the transmission line 
nature is consciously used to establish distributed amplification. 

A common problem in achieving high gain at microwave frequencies is that the nec- 
essary large transconductance of the amplifying device requires a large device size 
(source width or emitter area), which in turn invariably increases the input capaci- 
tance. In FETs, in a first-order approximation, the ratio of transconductance to input 
capacitance is the transit frequency: gm/Cos = wr. Ina common-source amplifier, the 
dominant time constant at the input is therefore 

&m Ay 


tT = ZgCgs = Zg— © -ZG VN ; 
@ OWT 


T 


where Zg is the generator admittance Y,, the load admittance and Ay the quasi-static 
voltage gain in common-source configuration. The input time constant is therefore 
directly linked to the voltage gain of the cell, for a given load admittance. 

In narrowband amplifiers, we may be able to compensate for the input capaci- 
tance using a matching network, as we have seen. Very wideband amplifiers, however, 
preclude the use of tuned networks. 

To find a way around the input capacitance limitation, we follow two fundamental 
steps: 


(i) Instead of using one large device, we will use several smaller ones to deliver the 
needed overall transconductance. 

(ii) The input (and output) capacitances will then be absorbed into an artificial 
transmission line. 


The second step is the most crucial one. To understand this concept, remember that 
any transmission line can be modelled using a ladder-type network of concentrated 
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Lumped-element equivalent circuit of a transmission line. 


Generator L’ L’ Load 


Lossless transmission line loaded with additional shunt capacitances. 


elements, such as shown in Figure 5.51. The line is characterised by its distributed 
inductance L’, capacitance C’, and the distributed series (R’) and shunt (G’) losses. The 
characteristic impedance Zo and the propagation constant y of the line are then: 


R' + joL! 
7 a ea (5.179) 
G'+ JaC’ 


y = JV(R' 4+ joL’)(G! + joC’). (5.180) 


Note that the propagation constant y= a+ j6, where a is the attenuation constant and 6 
is the phase constant. In many cases, the losses can be neglected (R’ « wL', G’ « wC’) 
and we obtain the simple relationships: 


L’ 
Zo) (5.181) 
B=oVL'C’. (5.182) 


This opens up a fundamental idea: any capacitance to ground can be made to disappear if 
itis absorbed into a transmission line — it will simply lower the characteristic impedance, 
and increase the phase constant. 

Consider Figure 5.52. The lossless transmission line is loaded by additional shunt 
capacitances C,. The transmission line parameters are now 


L’ 
c+ 


peoit (c + a), (5.184) 


The parameter / is the length of the transmission line segment between each shunt 
capacitance. 
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Distributed amplifier concept using FETs in common-source configuration. 


Provided that L’ and C’ are chosen in such a way that Yg = Y, = Z, or the trans- 
mission between generator and load is unaltered by the presence of the additional shunt 
capacitances! 

This observation is not new at all. Its earliest implementation is in the Pupin coils, 
periodically inserted series loading coils (increasing L’ in our example) which compen- 
sate for the capacitance to ground of telegraph and telephony lines. They were invented 
in 1894 by Serbian physicist Mihajlo Idvorski Pupin, following earlier suggestions by 
Oliver Heaviside in 1893. 

Of course, the LC combination also acts as a low-pass filter. The frequency 


1 
@Brage = 
LJU (C+ i) 


is called the Bragg frequency of the transmission line structure. The length / must be 
chosen such that the Bragg frequency is significantly above the intended frequency of 
operation. 

Distributed amplifier structures using electron tubes were first described by 
W. S. Percival in his 1937 patent [30]. 


(5.185) 


General design procedure 

We will now apply the concept to an arrangement of FETs in common-source configura- 
tion along two transmission lines, connecting the inputs and outputs, as shown in Figure 
5.53. Note that the transmission lines at input and output have different inductance and 
capacitance per unit area. The loading capacitances are now the imaginary parts of the 
input and output admittances of the common-source gain cells. Using Equations (5.109) 
and (5.111) and a simplified FET equivalent circuit, we write for the shunt capacitance 
loading the input line: 


C1 = Cos + Cop (: ‘ a.) (5.186) 
2¥> 
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provided that the output transmission line is terminated in its characteristic admit- 
tance Yo. 
The shunt capacitance loading the output transmission line is 


Co = Cps + Cop (1 4 su.) (5.187) 
2Yo 


where Cpg is the parasitic drain—source capacitance. 
The unloaded input and output transmission lines must be chosen such that 


e the loaded characteristic impedances correspond to generator and load impedances 
and 

e the phase delays between corresponding nodes on the (loaded) input and output lines 
are equal. 


Assuming identical generator and load impedances, Zg = ZL = Zo, we find 


Li 
Zi1= | ——~=2Zo (5.188) 
Ca 
\ 1 I; 
Ly 
Z2= | ——~ =2Zo. (5.189) 
Ch+% 
V2 76 
The phase synchronism requirement translates into 
Bil = polo 
Cc Cc 
| Li (<i + 1) =h- {LL (c+ 2). (5.190) 
1 2 


The difference in the unit amplifier cell input and output capacitances may result in 
very different design parameters for the input and output transmission lines. Figure 
5.54 shows this in a practical example. The distributed amplifier shown was fabricated 
in an experimental Si/SiGe HFET technology [1]. The transmission lines are realised 
in coplanar waveguide form. The difference in geometry for the input (gate) and output 
(drain) lines is clearly visible. 


Output line 


a : ” Out 


: { ' : 
In Ties _ ' ey ' : : ' 
Input line <a 


Chip micrograph of a distributed amplifier with 32 GHz bandwidth, realised in a Si/SiGe HFET 
technology (P. Abele, I. Kallfass, M. Zeuner, J. Miiller, Th. Hackbarth, D. Chrastina, H.v.Kénel, 
U. Konig, and H. Schumacher, Electronics Letters, Vol. 39, pp. 1448-1449, 2003. © 2003 
TEEE). 
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Distributed amplifier with bias arrangement. 


The terminating impedances for the input and output lines are placed off-chip in 
this example — which brings us to a general problem we did not address so far. The 
distributed amplifier concept in Figure 5.53 did not include the bias arrangement. If we 
apply a gate voltage to the input and a drain voltage to the output line, a constant current 
would flow through the terminating impedances attached to the ends of the transmission 
lines opposite to the input and output ports — resulting in generally unacceptable power 
dissipation there. The terminating impedances therefore need to be galvanically iso- 
lated from the transmission lines. A more practical schematic for a distributed amplifier 
would therefore look like Figure 5.55. The bias-related elements Cpiock and Lchoke set 
the lower cutoff frequency. If a very low lower cutoff frequency is desired, then the 
on-chip realisation especially of the blocking capacitors may be a significant challenge. 
Lchoke iS generally placed off-chip. 


Gain and loss in distributed amplifiers 
Without any losses, the theoretical voltage gain of a distributed amplifier with n stages 
should be 


Z 
Ay = 8m > (5.191) 


where gm is the transconductance of the individual cell and Zo the characteristic 
impedance of the output line. 

So far, we assumed that the transmission lines were lossless, and that the input and 
output admittances of the unit amplifier cells were purely capacitive. The latter assump- 
tions particularly are too bold, of course, and we need to assess how the resistive parts 
of the input and output admittances impact distributed amplifier performance. 

In most calculations so far, the gate (or base) series resistance was neglected. This we 
will abandon here. For the case of a FET, the input line is then loaded with an complex 
admittance: 
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JoC\ 
7 ee led 5.192 
: 1+ jaCi Ro ( ) 
RgC? Cc 
=? 91 gy : (5.193) 


Ww > 
1+ @?R2C} 1+ @*RZC; 


where Rg is the gate resistance and C; the input capacitance as before. As long as 
w < (RgC})~!, losses due to Rg need not be accounted for, but they will increase 
strongly for higher frequencies. 

For the output line, some attenuation is always present due to the real part of Y 59 in 
Equation (5.111), which is gps in FETs: 


Yo = gps + J@C2, (5.194) 


where C2 is the output capacitance as before. The loss introduced to the drain line is 
hence frequency-independent.* 

When the number of stages, 1, is increased, the power consumption scales linearly. 
However, with increasing n, the losses introduced by the amplifier cells become more 
important and lead to a situation where the gain scales sub-linearly. This introduces a 
practical limitation to the number of stages. For a detailed analysis, refer to Beyer ef al. 
(1984) [4]. 


Distributed amplifier variations 

Matching input and output capacitances 

A common problem in distributed amplifiers is that the amplifier cell input capac- 
itance C; is much larger than the output capacitance C2. In turn, the unloaded 
characteristic impedance of the output line will be significantly smaller than that of 
the input line. This is significant because the dispersion characteristics of the lines 
depend on their geometries — different geometries lead to different dispersions, and 
phase synchronism between input and output lines is increasingly lost with rising 
frequency. 

A simple technique is to increase the output capacitance. This can be done easily 
using a transmission line stub between the amplifier cell output and the output transmis- 
sion line. As the amplifier output shows a reasonably high impedance, the transmission 
line stub acts capacitively when seen from the output line. Figure 5.56 shows this simple 
concept, which is used in many practical amplifier examples. 

The input capacitance can also be lowered by introducing a series capacitance in 
the unit cell input port. This leads to a capacitive voltage division between the series 
capacitor and the input impedance of the amplifier, and hence a reduction in gain, but 
depending on the application, this may be tolerated for the benefit of an increased 
bandwidth. To allow proper biasing, the capacitor must be bridged with a high-value 
resistor which has no influence on the RF performance. The measure is shown in 
Figure 5.57. 


4 The loss due to additional drain (or collector) resistances can be neglected unless they are excessive. 
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(a) (b) 


Distributed amplifier unit cell with increased output capacitance: (a) concept and 
(b) implementation using a transmission line stub. 


Input capacitance reduction using a series capacitor. 


By changing the series capacitance value along the input transmission line (lower 
towards the generator and higher towards the termination), the input voltage across the 
amplifying device can be made equal despite the decreasing signal on the transmission 
line. 


Distributed amplifiers with a cascode cell 

Despite the potential of the distributed amplifier concept to eliminate input and output 
capacitances by embedding them into an artificial transmission line, there are good rea- 
sons to keep input and output capacitances low. One reason is that high input and output 
capacitances force the unloaded characteristic impedances of the lines to be very high — 
the signal-carrying lines then have to be very narrow and will exhibit high ohmic loss. 
Further, a high input capacitance means that the loss due to the gate resistance will start 
to matter at much lower frequencies (see Equation (5.192)). 

Choosing a cascode as the amplifier unit cell is therefore a logical choice. A simplified 
configuration is shown in Figure 5.58. 

We had seen that the cascode gain cell is prone to producing a negative real part of 
the output admittance (see p. 336). Here, this effect may be used with benefit to com- 
pensate for losses on the output line, but amplifier stability has to be carefully checked, 
especially at higher frequencies. 
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Cascode gain cells in a distributed amplifier structure (bias elements not shown). 
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Practical distributed amplifier design using (Al,Ga)As/InGaAs pHEMTs (bias circuitry omitted). 


Practical distributed amplifier examples 

40 GHz bandwidth distributed amplifier using GaAs pHEMTs 

Figure 5.59 shows the schematic diagram of a practical distributed amplifier using a 
pseudomorphic HEMT process [16]. Several of the measures discussed above have been 
taken here. The unit cell has a cascode topology, but additionally the input capacitance 
was reduced using a series capacitor in the input line. The series capacitor is bridged 
using a high-value resistor; the additional resistor to ground at the gate node improves 
gain flatness at low frequencies. 

The gate termination does not have a DC blocking capacitor here, because the gate 
line is held at 0 V — the source resistor Rs; provides the slightly negative gate-source 
voltage. Note the elaborate drain termination. This is rather typical of distributed ampli- 
fiers for fibre-optic systems where a lower cutoff frequency in the kHz range is required: 
a broadband termination is created using several RC networks with staggered time 
constants. The largest capacitor (100 nF in this case) is necessarily placed off-chip. 
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Chip photo of the amplifier shown in Figure 5.59. 
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Frequency response of gain (|521|), and input and output reflection coefficients (|S 11], |S22|) 
of the distributed amplifier in Figure 5.59. 


The design deliberately uses the negative real part of the cascode cell output admit- 
tance to compensate for drain—line losses. Rg2 and Rs improve stability together with 
the transmission line in the source lead of the cascode, which acts as a small induc- 
tor and reduces the cell’s gain with increasing frequency, avoiding instability at higher 
frequencies. 

Figure 5.60 shows the chip micrograph of the distributed amplifier. It has six gain 
stages and is implemented using standard microstrip line technology (the back of 
the chip is metallised). Two adjacent stages share via the connections to ground — 
this requires careful assessment of interstage cross-talk issues, but is very efficient in 
reducing the necessary chip area. 

The experimental frequency response (Figure 5.61), shows a very flat gain up to 
about 40 GHz, where the gain drops sharply. This is a very typical feature of distributed 
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amplification. Another noteworthy feature is the low reflection coefficient for both input 
and output over a very wide frequency range, which is due to the distributed nature of 
the input and output impedances. 

The midband gain is 11dB, the output power at 1dB gain compression (for a 
definition, see Figure 5.72 on p. 372) is 22.6 dBm measured at 20 GHz. 


A distributed amplifier on Si using Si/SiGe HBTs 

The distributed amplifier concept is not restricted to FETs. They can also be realised 
using bipolar transitors or HBTs. In the example used here, the goal is to realise a 
distributed amplifier in a production Si/SiGe HBT process on lossy substrates. 

The latter issue, the lossy substrate (20 Qcem specific resistivity), introduces an addi- 
tional complication because neither standard microstrip transmission lines (which use 
the substrate as the dielectric) nor coplanar waveguides (which would equally introduce 
large substrate losses) can be used. Instead, a thin-film microstrip transmission line 
technique (Figure 5.62) was chosen, which creates the microstrip line entirely above 
the substrate. Here, the signal line was placed in metal 3, while metal | acts as the 
ground plane, shielding the signal completely from the lossy substrate. The thin dielec- 
tric, however, leads to very narrow signal lines for the characteristic impedances in 
question (50-100 &) and strongly increases series resistance losses. 

Furthermore, the input admittance of a bipolar transistor is not purely capacitive, as 
we could safely assume for FETs. Using the hybrid z equivalent circuit of Figure 5.17, 
we can estimate the admittance Y ,, for a bipolar transistor: 


vf I 
Y10,bipolar © <4 ie) (cine + nie) : (5.195) 
, Bp Vr Vr 


where fr is the small-signal current gain in common-emitter configuration, Tg is the 
base transit time, Jc is the collector current in this bias point and Vp = kT/q is the 
thermal voltage. The real part of Y ;. would strongly attenuate the signal travelling on 
the input line and has to be eliminated. 


Passivation Metal 3 


Metal 1 


SiO, inter-metal dielectric ral 


Lossy Si substrate (e.g. 20 Qcm) 


Example of a TFMSL on a silicon substrate. 
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The latter problem can be solved using a common-collector (emitter follower) input 
stage (Equation (5.126)): 
Y 
Y) ~ Yo + — : 
lt 


It is evident that the input admittance is much smaller. Furthermore, we had seen in 
Equation (5.128) that given a capacitive component of Y,, the real part of the input 
admittance becomes negative. This can be used to compensate for ohmic losses on the 
input line, but always bears the risk of instability. 

If a cascode gain cell is chosen, the negative real part of its output admittance can 
equally be used to compensate for ohmic losses on the output line, with the same 
stability caveat. 

Figure 5.63 shows an example of a differential amplifier where all of these measures 
have been taken [33]. It was realised in Si/SiGe HBT technology, with transistors of 
St. fmax = 80 GHz, on a 20 Qcm substrate. 

Three cascaded emitter followers are used in the input to achieve the appropriate low 
input capacitance and negative input conductance. The differential cascode gain cell 
has open collector outputs which connect directly to the output transmission lines. The 
capacitively shunted emitter degeneration resistors in the common-source pair improve 
the bandwidth through a positive gain slope of this stage. 

Note the extensive use of level shifting diodes (transistors with their base—collector 
contacts tied together). This is necessary due to the low collector—-emitter breakdown 
voltages typical of high- fr Si/SiGe HBTs. 

The unusual differential topology solves an additional problem of silicon-based 
MMICs: The absence of through-the-substrate via holes makes low-inductance ground- 
ing highly critical. The differential topology eases packaging by creating an on-chip 
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Schematic of a differential distributed amplifier gain cell using Si/SiGe HBTs. 
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Fig. 5.64 
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Chip photo of the differential distributed amplifier. 


ground, as already discussed. In wideband amplifiers, it is not suitable for all system 
architectures, however, due to the need for ultra-wideband baluns. 

Figure 5.64 shows the chip micrograph of the structure. The chip size is 1.7 x 
0.7 mm?. The narrow width of the thin film Microstrip line is very apparent. The 
differential gain is 13.6 dB and the —3 dB bandwidth is 32.2 GHz. 


Low-noise amplifier 


A very frequent requirement is the design of an amplifier with minimum noise figure — 
an LNA. This is especially important in weak signal reception environments such as in 
satellite receivers. 

We have seen earlier that the noise figure of any two-port depends on the source 
reflection coefficient presented to it (see p. 310). The parameters needed for noise- 
optimum design are 


(i) the noise-optimised source reflection coefficient for which the two-port noise figure 

is minimal: Is opt; 

(ii) the minimum noise figure Finin which provides the two-port noise figure under the 
condition that the source reflection coefficient is the noise-optimised one: 's = 
T's opt; 

(iii) the normalised equivalent noise resistance r,, which describes the sensitivity of the 
noise figure F' on deviations from the noise-optimised source reflection coefficient 
V's opt 

Using these parameters, the noise figure is given by 

2 
4rn [T's _ T's ,opt| 
(1 _ [C's|?) |1 Ts opt| 


In practical two-ports using active devices, the noise parameters are also bias-dependent. 
Of particular interest is the dependence of Finin on the drain or collector current. 
Qualitatively, it is shown in Figure 5.65. 

An additional aspect needs to be considered — while in principle any reflection coef- 
ficient || < 1 can be transformed into any other using reactive matching networks, 
practical limitations need to be considered. If the end points of the transformation are 
located too far apart, the resulting matching network will either be very narrow band 
(if the reacting matching elements are sufficiently low loss) or introduce significant 


ferns (5.196) 
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Fig. 5.66 


High-Speed Electronics and Optoelectronics 


\* 


Drain or collector current | ———» 


Finin ———> 


Fopt iy 


Qualitative dependence of the minimum noise figure on the source or collector current. 
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Noise matching example using device scaling and impedance transformation. 


additional losses, which deteriorate the noise figure according to Friis’ formula. For 
LNA design, this means that Is op: should be suitably located. I's opt can be changed by 
changing the device width (‘scaling’) — a larger device width results in larger values of 
Y S,opt- 

For a better understanding, refer to Figure 5.66. We assume that the original source 
reflection coefficient I's, e.g. the feed point impedance of an antenna at resonance, is 
real, and the corresponding impedance is equal to the normalising impedance of the 
Smith chart, hence I's = 0. The original noise-optimised reflection coefficient I's opt 
is located too far towards the outside of the Smith chart. By choosing a larger device, 
I's opt is achieved in a location which is much closer to I's. In fact, this location is ideal 
because the transformation from I's to iw opt C40 be achieved conveniently using only 
a series inductance. 
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The fundamental design steps of the LNA’s input stage are hence the following: 


(i) Pick a suitable device size which puts Is opt into a convenient location with respect 
to the original source reflection coefficient I's. 
(ii) Adjust the bias point such that the optimum Fymin is achieved. 
(iii) Design the input matching network. 


Because the bias point affects I's opt, a few iterations may be necessary. 

In principle, matching for optimum noise performance (I's = Is,opt) and matching 
for optimum power transfer at the input (M's = I) are unrelated. A frequent require- 
ment, however, is the combination of optimum noise performance and a minimum return 
loss, hence i * T's opt. This cannot be achieved using impedance transformation net- 
works between the source and the LNA input, because that would modify I's opt for the 
resulting two-port and Tin in the same way. Instead, Ij, can be modified in two ways 
which leave Is opt invariant: 


(i) through lossless feedback; 
(ii) by mismatching the output for non-unilateral two-ports, utilising the fact that the 
input reflection coefficient depends also on the load reflection coefficient: 


So) Sp2V 


Tl, = §j4.———_. 
in a 


Figure 5.67 summarises the individual reactive networks surrounding the LNA core, 
which can be used in the design to fulfil noise and return loss specifications. 

The feedback elements Z, (series or current-voltage feedback) and Zp (parallel or 
voltage—current feedback) are used to set Fi, while leaving I's op invariant, as dis- 
cussed. M1 provides noise match or, after suitable modification of Tin using feedback 
techniques, simultaneous noise and power match (minimum noise figure and minimum 
input return loss). M2 can be chosen either to present the needed I’, to the LNA core 
for adjustment of Ij, (see above), or to achieve power match at the output (minimum 
output return loss). 

Very commonly, Za is an inductor. As already shown in Equation (5.141) this pro- 
vides an increased real part of the input impedance. Consider the case depicted in 
Figure 5.68. 
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Zn 
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Matching and feedback networks in LNA design. 
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Simultaneous noise and power match example using inductive series feedback and an input 
matching network M1. 


Without any feedback or matching network, the input reflection coefficient is Tin, 
corresponding to an input impedance of Rin — J Xin. The location indicated in the exam- 
ple would be typical for a FET. The goal is now to transform Ij, to a new location 
mM» rs ee We connect an inductor L in series to the LNA core. Applying Equation 
(5.141), we find for the input impedance of the LNA core with feedback: 


Zi, = Rin + orl + J (ol — Xin). (5.197) 


On the Smith chart, the transformation path corresponding to the effect of L can be 
interpreted as first increasing the imaginary part, starting from Ij, and then increasing 
the real part, as shown in the lower part of Figure 5.68.° 

In a second step, matching network M1 (which in the example is simply a series 
inductor) transforms both ra and I's opt towards I's, achieving the required simultane- 
ous optimisation of noise and input return loss. 

For the LNA core, cascode stages (see p. 333) are very frequently being used at 
microwave frequencies. This is because the aforementioned scaling, placing I's opt in an 
‘easily matchable’ location results frequently in relatively large transistors, where the 
Miller effect (discussed on p. 318) can be significant — adoption of a cascode topology 
is a proven way to reduce the increased input capacitance associated with the Miller 
effect. 


5 As an aside, you may notice that with increasing L, Fin decreases — the associated gain Gass, however, 
also decreases. The entity invariant to reactive feedback is the noise measure M = Fpin/ (1 - Gas): 
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First stage of a three-stage LNA for 24 GHz using Si/SiGe HBTs. 


Layout of the three-stage 24 GHz LNA. 


AS a practical example, we will discuss a three-stage LNA for 24 GHz using Si/SiGe 
HBTs [35]. The schematic of the first stage is shown in Figure 5.69. 

Transistors Q2 and Q3 form the cascode LNA core; Q1 forms a current mirror with 
Q2 to set the latter’s collector current. Q3’s base voltage is then set using the voltage 
divider R4/RS. All capacitors are large-value bypass capacitances. 

Inductor L1 is used to allow simultaneous noise and power match along with the 
proper sizing of Q2. There is no on-chip inductance in series with the Jn port, because 
the bond wire is used instead, efficiently including this parasitic into the design. L2 
forms, together with the capacitance between the collector Q3 and ground and the input 
capacitance of the following stage, a parallel resonance which provides the LNA with a 
bandpass characteristic. 

The other two stages are identical in topology, but due to the different source 
impedances of the preceding stages, the inductive source degeneration of the common- 
emitter transistor (Q2) is adjusted. 

Figure 5.70 shows the layout of the three-stage amplifier. Note that the source degen- 
eration inductors are constructed as two-layer stacked inductors, while the tank circuit 
inductors (L2) are conventional spiral inductors. 
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Gain and noise performance of the 24 GHz LNA using Si/SiGe HBT technology. 


The circuit was realised in a Si/SiGe HBT technology with ft, fmax = 80GHz 
and characterised on wafer. Results of a small-signal characterisation are shown in 
Figure 5.71. The circuit shows the targeted bandpass performance with the gain peak 
at 24 GHz (the intended application is in the 24 GHz ISM band). The minimum noise 
figure at 24 GHz is 5.6 dB, while the 50 & noise figure is slightly below 6 dB. This devi- 
ation is not surprising, as the circuit was designed to provide optimum noise figure with 
the bond wire parasitic included. The gain with a 50 Q source impedance is Gsg = 
21.4 dB, while the associated gain under noise match conditions is Gass = 22 dB — in 
this circuit, the noise-optimised source impedance is actually slightly closer to 50 Q 
than the input impedance. 


Amplifier linearity 


So far, we treated amplifiers as perfectly linear systems — the output signal can always 
be described as a linear combination of the input signals. In reality, however, any cir- 
cuit including active devices will show a non-linear behaviour and the assumption of 
linearity holds only for small deviations around a given operating point. 

In practice, the non-linear behaviour of amplifiers will generate nonlinear distortions, 
which create non-linear deviations in time-domain signal shape, and additional spectral 
components in the frequency domain which have to be reckoned with. 


Single-tone excitation 
A common way to treat general non-linear functions is the Taylor series expansion. A 
non-linear function f(x) is expanded around x = xo as 


[ee 


(vy) 
oS Ly — x0)”, (5.198) 


v=0 


where f (xq) is the v™ derivative of f with respect to x in x = xo. 
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Now assume that we apply a single sinusoidal signal to our non-linear system: 
a(wt) = Ag+ Asin(a@t). 
The output signal f(wt) can now be described by the following Taylor series 
expansion (xo = 0): 
f (@t) = koAo (5.199) 
+ kj sin(wt) 
+ ko sin?(ot) 
+ k3 sin? (ot) 
FE saves 


The first two lines in Equation (5.199) provide the linear response, while the following 
terms are non-linear distortions. Consider that 


1 
sin’ (wt) = 5 [1 —cos2or] 


1 
4 [3 sin(wt) — sin(3@t)], 
and we find that Equation (5.199) turns into 


sin? (wt) = 


f(ot) = koAo + “ (5.200) 


3k 
+ (« + =) sin(wt) 


© sesaest) 
— — COS 
2 (69) 


k3 in(3ot) 
— — sin 

4 ra) 
+.... 


We easily see that the non-linear distortion results in new spectral components (har- 
monics) being generated, which are related to the fundamental components as integer 
multiples. 

A simple procedure to assess an amplifier’s linearity is the single-tone excitation test. 
A test generator with high spectral purity and adjustable power is connected to the 
input of the amplifier, and a spectrum analyser to the output. Increasing the input power 
(Pin), the power of individual spectral components at the output is recorded. Plotting the 
output power levels as a function of the input power on a double-logarithmic scale, we 
obtain a graph similar to the one shown in Figure 5.72. 

For low power levels, the output power of the fundamental spectral line will increase 
linearly with the input power. Gradually, it will, however, rise more slowly — gain sat- 
uration sets in. When the power ratio between the extrapolated linear increase and the 
actual curve is | dB, the 1 dB compression point (P_jqp) has been reached. It is a mea- 
sure of the maximum power the amplifier can deliver in linear operation. Depending 
on the application, it is referred to the input (e.g. LNAs) or the output (e.g. power 
amplifiers). 
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Single-tone excitation test of an amplifier, showing definitions for the 1 dB compression, 
second-order intercept and third-order intercept points. 


The spectral power at the second harmonic increases twice as fast as the fundamental 
power, before it also shows saturation. Extrapolating the curve at low input powers, we 
find the single-tone second-order intercept point at the point where the extrapolation 
intersects the extrapolated fundamental power. 

The spectral power at the third harmonic increases three times as fast as the fun- 
damental power. Its extrapolation intersects the extrapolated fundamental power at the 
single-tone third-order intercept point. 

The intercept points can also be referred to the input or the output, depending on the 
application. 

In many applications where the operational bandwidth is only a small fraction of the 
carrier frequency, the generation of harmonics is not necessarily a problem, because they 
can easily be removed by filtering. For example, frequency modulated (FM) transmitters 
are operated under strongly non-linear conditions (class C, see p. 377), and the resulting 
harmonics in the output signal are simply removed by low-pass filtering. 


Two-tone excitation 

An FM signal is a particularly simple example of modulation, because the resulting 
signal has only a single spectral component (which varies in frequency, but that is irrel- 
evant here). Most modulated signals, however, consist of many spectral components 
which are present at the amplifier input simultaneously. 

To understand what amplifier non-linearity will do to these signals, let us construct a 
simple experiment, where the input signal is formed by two spectral components (at @ 
and w2) of equal amplitude, applied to the input of the amplifier. The output is connected 
to a spectrum analyser again. The corresponding block diagram is shown in Figure 5.73. 

Again, the test generators need to have very high spectral purity. 
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Schematic representation of a two-tone excitation test of an amplifier (DUT = device under test). 


Mathematically, the description of the distorted output signal becomes much more 
complex. We obtain: 


e Fundamental components at @, and w2. 
e Harmonics of the input signals (at 2@1,2, 3@1,2,...). 
e Components due to the product of the two input signals — consider that 


sin(w ft) sin(w2t) = ; {cos [(w — w2)t] — cos[(@, + @2)t]}. (5.201) 


The multiplication term therefore produces spectral components at the sum and 
difference of the two input spectral lines. These components are called two-tone 
second-order intermodulation products. 

e Components due to the product of a fundamental component and a second-order 
harmonic 


1 
sin? (@ 1) sin(@2t) = 5 [sin(w2t) — sin(2@\t + wot) + sin(2@\t — wet)] (5.202) 


1 
sin? (@f) sin(@,t) = 5 [sin(w ft) — sin(2@.t + w,t) + sin(2a@2t — w,t)]. (5.203) 


These terms hence generate spectral components at 2@1,2 + 2,1, which are called 
two-tone third-order intermodulation products. 
e Higher-order components which are neglected here. 


Figure 5.74 schematically shows the spectral components generated by non-linear dis- 
tortion of a two-tone signal, up to the third order. Note that second- and third-order 
harmonics as well as the second-order intermodulation products are significantly far 
away from the original signal and can most often be removed by filtering. Of particular 
concern are two third-order intermodulation products at 2m; —q@2 and 2m2 —@ , because 
they are close to the original spectral components and cannot be removed by filtering. 

Just as in case of the single-tone excitation, we can plot the output powers at the fun- 
damental tones and the close-in third-order intermodulation components as a function 
of the input power. Figure 5.75 shows an example of such a measurement. The two- 
tone third-order intercept point is found by extrapolating the low-power portions of the 
curves, where the relationship between input and output powers has a linear shape on a 
double-log plot. It can be referred to the input or the output. 

Often the system requirement will be formulated in terms of the intermodulation 
distance (IMD) or the dynamic range, not in terms of the intercept points. 
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Schematic representation of spectral components generated from a two-tone excitation through 
second- and third-order non-linearities. 
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Determination of the two-tone third-order intercept point. 


The IMD, measured in a two-tone excitation test, is simply the power ratio between 


the power level of the two carriers at the amplifier output and the highest inter- 
modulation spectral lines. The most prominent ones will typically be third-order 
intermodulation products. Then, the IMD can be calculated from the third-order inter- 
cept point. Consider again Figure 5.75 and remember that on the double-log scale, the 
Pout = f (Pin) transfer curve for the fundamental component has a slope of 1, while it 
is 3 for the third-order intermodulation products. The distance between the curves for 
the fundamental and the intermodulation products is the IMD on the log-log scale. The 
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Determination of the SFDR and the BDR in a two-tone excitation measurement. 


powers are expressed most often in dBm° on a logarithmic scale. Therefore, 


IMD | /IIP3 Pin 
dB \dBm_ dBm)’ 


(5.204) 


where IIP3 is the third-order intercept point referred to the input. 
On a linear scale (powers in W), the IMD can be expressed as 


IIP3 \* 
IMD = (5.205) 
Pi 


Specification of an amplifier in terms of dynamic range combines linearity and noise. 
There are two definitions, which are compared in Figure 5.76. 

The spurious-free dynamic range (SFDR) is the IMD at the point where the power 
of the third-order intermodulation products is equal to the noise floor. The blocking 
dynamic range (BDR) is the distance between the 1 dB compression point P_jqp and 
the noise floor. 


Adjacent channel power ratio 

Modern communication systems have frequently very complex modulation schemes, 
with many spectral components present. They are, therefore, very sensitive to inter- 
modulation effects in non-linear amplifiers. A two-tone measurement can only give an 
indication of linearity, but is no solid proof of the amplifier’s suitability. 

A very realistic test is the ACPR (adjacent channel power ratio) test, which is always 
specific to a certain modulation technique. Figure 5.77 shows an example for a UMTS 
signal. In a first step, the integral powers within the channel bandwidth need to be cal- 
culated from the spectral analysis. Then, the ACPR is calculated as the ratio of the 
power in the band of operation to either the lower or higher adjacent channel. It is a 


© dBm means decibels relative to 1 mW, i.e. 0dBm = 1 mW, 20dBm = 100 mW, etc. 
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Example of an Adjacent Channel Power ratio (ACPR) measurement: Power spectral density 
(PSD) versus frequency, with channel limits indicated. 


direct measure of the interference generated by transmitter non-linearities in adjacent 
channels. 


Power amplifiers 


Power amplifiers have the task of amplifying signals before they are delivered to loads, 
such as antennas or cables. Critical criteria are 


e maximum output power, for example measured in terms of output power at the | dB 
compression point P_; ap (see Figure 5.72); 

e gain (either small-signal gain or large-signal gain at a given output power); 

e gain and potentially phase deviation across the operational bandwidth; 

e linearity, defined by parameters such as the output-referred two-tone third-order 
intercept point, the IMD at a given output power, or the ACPR at a given output 
power; 

e efficiency — at microwave frequencies, it is customary to use the power added effi- 
ciency (PAE), the ratio of the power difference between output and input to the DC 
power: 


Pina? 
PAE = 4" = (1 = <) (5.206) 
Ppoc 


where 7 is the collector or drain efficiency (7 = Pout/Ppc) and G the amplifier gain. 
Classes of operation 


Since the days of vacuum tubes, amplifier operation has been described by classes, 
which describe where the amplifying devices are biased in a quiescent state. 
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HEMT drain current /p and transconductance gy as a function of the gate-source voltage Vgs 
with bias points for power amplifier classes A, B and C indicated. 


For an understanding of the ‘classical’ classes A, B and C, refer to Figure 5.78. 
The example shows the drain current and transconductance of a HEMT. For the 
classification, we observe the drain current curve. 

In a class A amplifier, the gate-source voltage Vgg is set in the region where the 
output current /p is a linear function of the input voltage Vgs — the transconductance is 
approximately constant. For both positive and negative half-waves of the input signal, 
current will flow — the conduction angle is 360°. In this bias point, the amplifier will 
exhibit a very high linearity, but low efficiency. The theoretical maximum is 50%, but 
at microwave frequencies, values of 30% would already be very satisfactory. 

For class B, the device is biased at pinch-off. Only the positive half-wave of the input 
signal will then generate an output current flow — the conduction angle is 180°. The 
efficiency will increase theoretically to 78.5% (2/4), and at microwave frequencies it 
can still reach 60% or higher, but the deviation from a sine wave in the output current 
creates non-linear distortions. 

A class C amplifier has a quiescent bias point where Vgs is significantly below the 
threshold voltage Vi,. Output current will flow only if the momentary Ves(t) > Vin, 
therefore the conduction angle is <180°. The efficiency can still be higher; however, 
due to the lower conduction angle, the non-linear distortions are also increased. 


Switched amplifiers 

There is another interpretation of class C operation, which is helpful in understanding 
the way that amplifiers in class D, E and F operate. For this, look at Figure 5.79. It 
shows the output I-V characteristics of a HEMT (but this could be any FET). Provided 
that the driving voltage is large enough, the transistor simply changes between two 
saturated states with very different differential resistances. In the quiescent point, the 
transistor is in cut-off and the differential resistance between drain and source is very 
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Saturated class C operation in the output I-V characteristics of a HEMT. 
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(a) Simple class C amplifier topology and (b) its equivalent circuit. 


high. For a sufficiently high input voltage, the transistor reaches another saturated state 
with small rps. 

We can, therefore, model the transistor in saturated class C operation simply by a 
switch in series with its residual differential resistance rps. Figure 5.80(a) shows a sim- 
ple class C amplifier stage. The load is embedded in an LC resonant circuit which acts 
as a bandpass filter to suppress the harmonic frequency components other than the fun- 
damental. The RF choke (Lchoke) provides a constant current, at least on the time scale 
of interest. This circuit can also be realised with an LC parallel resonant circuit, by 
the way. 

Replacing the choke with a constant current source, and the FET with a periodically 
actuated switch and its series resistance rps, we arrive at the equivalent circuit in Figure 
5.80(b). The class C amplifier operates in this configuration by periodically shunting 


current away from the load. 
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Class C amplifier: (a) power factor a and (b) drain efficiency f as a function of frequency. 


A detailed analysis of class C operation is found in [17]. First, note that due to the RF 
choke, the average voltage across the load is the supply voltage Vpp. The peak voltage 
is (1 + @)Vpp and the minimum voltage (1 — a) Vpp, where 


w(6) = 4 sin (5) 


= —— (5.207) 
6 + sin(@) + a 


Here, @ is the conduction angle. Note that for rps — 0, @max = a(@ = mw) = 1.27- 
the maximum voltage across the transistor can, therefore, exceed the supply voltage by 


a factor of 2.27. 
The output power in saturated class C operation is 


(@Vpp)” 
2R, 


(5.208) 


Pout = 


The drain efficiency is 
rDS a? 


5.209 
RL 6 — 2a sin ($) as 


n@)=1 


Both the power factor a and the drain efficiency n are shown in Figure 5.81 as a function 
of the conduction angle @. Note that the output power always peaks at 6 = zr, but that 
the efficiency has its maximum at much lower conduction angle. The normalised on- 
resistance of the FET, rps/ Rt, has a significant influence on both the output power and 


the drain efficiency. 


Class D amplifier 
Above, we interpreted the class C amplifier as a resonant circuit driven by current 


pulses, where for maximum efficiency the current flow angle was 6 < z. We can, 
of course, not only turn the current on and off, but actually reverse it, as shown 


schematically in Figure 5.82. 
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Example implementations of class D amplifiers. After [17]. 


Instead of a single-pole, single-throw switch, the equivalent circuit shows a double- 
pole, double-throw switch which periodically reverses the current through the load. The 
parallel resonant circuit again eliminates all harmonic frequency components except the 
fundamental one. 

In practice, the switches are realised with transistors, of course. Figure 5.83 shows 
two examples. In Figure 5.83(a), the current reversal is achieved using a transformer 
where the current is fed into the centre tap, and the ends of the primary coil are 
connected alternatingly to ground. This is a very common solution at lower frequencies. 

At microwave frequencies, the transformers are difficult to realise, and in any case 
they do not lend themselves easily to monolithic integration. The circuit in Figure 
5.83(b) is then more practical — it avoids transformers altogether; however, now we 
have four transistors to control instead of two: in this bridge configuration, transistors 
QI and Q4, and Q2 and Q3 conduct alternately to achieve the current phase reversal 
across the load. 
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Class D amplifier example using a series-fed load. After [17]. 


Note that in both implementations, the load floats — it has no direct ground reference, 
which is often a problem for microwave systems where ground-referenced (single- 
ended) transmission is more common. This can be avoided in a class D amplifier if 
a series-fed load is applied. This is shown in Figure 5.84. The bandpass function is now 
realised with a series resonant circuit (Lo, Co), and the voltage is alternated, not the cur- 
rent. Still a balun is needed at the input, unless V(t) is already available in differential 
form. 

The class D amplifier output power is in saturation [17]: 


2V 
Pot = — LE, 5.210 
out m2R,, ( ) 
while the drain efficiency is 
Ri 
= ———— 5.211 
7 Rt rDs 


at least for the circuits according to Figures 5.83(a) and 5.84. rps is again the channel 
resistance of the FET for low Vps (‘on-resistance’). For the circuit in Figure 5.83(b), 
the efficiency is lower because the switch resistance doubles. 

Class D amplifiers place high demands on the ideality of the switches and on the 
timing. This is particularly true for circuits such as in Figure 5.83(b) or 5.84, where 
switches are stacked — they must never conduct at the same time, not even for brief 
periods. Therefore, class D amplifiers are mostly restricted to lower RF frequencies. 


Class E and F amplifiers 

Class E and F amplifiers are derived from class C. The idea in a class E amplifier [38] is 
to make sure that the drain-source voltage of the switching transistor (see Figure 5.80) 
is zero when the transistor switches, leading to a reduction of losses due to capacitive 
charging. This can be achieved by a modified output network. Consider Figure 5.85. At 
first glance, it looks like a class C amplifier with a series resonant feed, with the addition 
of shunt capacitor C2. Additionally, the series resonant filter is tuned to a frequency 
below the intended frequency of operation wo by increasing L1. Adjusting L; and C2, 
the voltage-free switching condition is found and the efficiency is maximised. 
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Simplified class E amplifier schematic. 


An in-depth analysis of class E operation can be found, e.g. in [8]. The inductance 
L, is chosen as 
le 1+ 1.153@0C1 RL 


5.212 
I Tah ( ) 
The capacitance C2 is 
2 
© = ——: 5.213 
* ~ 3.46Toom Ri cd 


A problem may be that the maximum drain-source voltage is even higher than in the 
class C amplifier, Vps max © 3.56Vpp. 

The class F amplifier increases the efficiency by appropriately terminating the har- 
monics. The idea is to achieve square-wave voltage excitation with respect to the 
drain-source voltage, as in case of the series-fed class D amplifier (Figure 5.84). 

A well-known fact from Fourier analysis is that a square wave (rectangular sig- 
nal with 50% duty cycle) in the time domain produces only odd harmonics in the 
frequency domain. We must, therefore, make sure that all odd-numbered harmonics 
(n = 1,3,5,...) are still present in the drain-source voltage. The load should, therefore, 
present an open to the transistors at these frequencies. 

A way to achieve this is to use the transforming properties of quarter-wave transmis- 
sion lines, which we discussed much earlier; see Equation (5.171) on p. 349. Assume 
that a transmission line, which is a quarter wavelength long at the fundamental fre- 
quency, is terminated by a short at all harmonic frequencies except the fundamental. 
Then an open will appear at the input for all odd-numbered harmonics, while a short 
results for all even-numbered harmonics, where the electric length of the transmission 
line is a multiple of 1/2. The modification of the original class C topology (Figure 5.80) 
is quite straightforward, as Figure 5.86(a) shows. A is the wavelength at the frequency 
of operation wo. The resonant circuit formed by Lo, Co is resonant at wo; Co then acts 
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Topology of a class F amplifier: (a) using a quarter-wave transformer and (b) using a parallel 
resonant trap tuned to the third harmonic. 


as a short at the higher-order harmonics. This short is transformed into an open by the 
quarter-wave transformer at all odd-numbered harmonics. 

In monolithic integration, the transmission line transformer is frequently much too 
long, and it may introduce significant losses. In many cases, it is perfectly acceptable to 
just use the third harmonic. This is shown in Figure 5.86(b). Here, a simple parallel LC 
circuit blocks the third harmonic (39), while it acts as a short for all other harmonics 
and the fundamental frequency. 


Oscillators 


Oscillators are crucial components in almost any microwave system. Their fundamental 
task is to generate AC energy at a well-defined frequency from DC sources. A typical 
use of an oscillator is shown in the generic receiver block diagram of Figure 5.87, where 
it converts the input signal to a lower intermediate frequency. The mixer circuit, which 
is also needed for the frequency translation, will be discussed in the next section. 

The class of oscillators discussed here has three aspects in common: 


(i) a resonator to set the frequency of oscillation, 
(ii) the generation of instability to allow the onset of oscillation, and 
(iii) amplitude control to establish a stable amplitude in steady state. 


Simple relaxation-type oscillators, such as found in simple digital timing circuits, will 
not be covered here. 


Resonators — a brief overview 


The resonator has the task of setting the oscillator’s natural frequency. 
The most common resonator types are 


e lumped-element LC resonators, which come in either series or parallel resonance 
forms; 
e transmission-line resonators; 
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e cavity resonators using waveguide elements; 


Generic receiver block diagram. 


e dielectric resonators, which use high €, ceramics and are typically combined with 
transmission line coupling structures; 

e quartz crystals and similar devices which use the piezoelectric effect to derive elec- 
trical from a mechanical resonance — surface acoustic wave (SAW) and bulk acoustic 
wave (BAW) resonators also fall into this category. 


Other resonator types, such as the magnetically tuned YIG (yttrium iron garnet) 
resonators, have only very limited use in speciality applications. 
Critical aspects for resonators are 


e the quality factor, which will be discussed in more detail below; 

e the reproducibility of the resonant frequency — this can be a significant problem in 
BAW and SAW resonators; 

e the stability of the resonant frequency against changes in temperature, mechanical 
shock and aging; 

e the tunability of the resonant frequency — mostly established using variable capaci- 
tance elements. 


For fixed-frequency oscillators, quartz crystals can still be considered to be the gold 
standard. Replacement of quartz resonators by elements which can be monolithically 
integrated is highly desirable and a hot research topic. 


Quality factor 
A very generic definition of the quality factor compares the stored and the dissipated 
energy in the resonator [20]: 


stored energy in the resonator 


O=2 (5.214) 


a, : > 
dissipated energy during one cycle 
for @ = wo. 
Let us consider RLC resonators (Figure 5.88) as an important example — via equiva- 
lent circuits, other resonator types can be transferred into RLC type resonators, at least 
in the immediate vicinity of the resonant frequency. 
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RLC resonator circuits: (a) parallel topology and (b) series topology. 


For the parallel resonant circuit (Figure 5.88(a)) at resonance w = wo = 1/V LC, the 
impedance is purely resistive and the dissipated energy during one cycle is 


17?R I?R 
Ediss = om — =>; (5.215) 
2 on 0 


where / is the amplitude of the sinusoidal current flowing through the resonator. 
The stored energy moves back and forth between the inductor and the capacitor; 
therefore, it suffices to calculate it for the capacitor: 


eS ae ee 
Estored = 5CV = 5¢ (ik) : (5.216) 


Inserting Equations (5.215) and (5.216) into (5.214) yields the Q factor for the parallel 
RLC resonator. This quality factor is called the unloaded Q because the loading resistor 
(Ro) has not been taken into account: 


R R 
Qy = wRC = — = —. (5.217) 
woL Es 
Cc 
The loaded Q is calculated by connecting Ro in parallel to R: 
RR 
Q1 = wp —— “i (5.218) 


R+Ro 14+ f° 


Similarly, we can calculate the unloaded Q for the series resonator (Figure 5.88): 


16 
Ou = cook = vé (5.219) 
eR RR? 
and the loaded Q as 
Q 
OQ, = = (5.220) 


Self-excitation criteria 


Early in this chapter, we considered two-port stability from the viewpoint of avoiding 
parasitic oscillations in amplifiers. Now the task is to deliberately create instability in a 
certain frequency range. 
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Connection of a resonator to an oscillator core. A parallel RLC resonator is chosen as an 
example. 


In the above resonator examples, the resonators always had a dissipative element 
associated with it. In practice, this is indeed always the case as resistive and radiative 
losses are never fully avoidable. In terms of the reflection coefficient seen looking into 
the resonator, this means 


Pres] < 1. 
The stability boundary can be written as 


TresPose = 1, (5.221) 


where Fosc is the reflection coefficient looking into the oscillator core. Because the 
reflection coefficients are generally complex entities, Equation (5.221) has to be 
decomposed into a magnitude and a phase condition: 


Pres] + Tose] = 1 (5.222) 

L(Tres) + £(Tosc) = 0. (5.223) 

The oscillator core will therefore necessarily have to provide |Tosc| > 1. Because we 

have seen in the Smith chart discussion that for all Re(Z) = 0,...,00, |F| < 1, 

this means that the real part of the oscillator core input impedance will have to be 
negative. 


Another way of determining the proper conditions for oscillation is to use the 
Barkhausen self-excitation criterion. The block diagram for this discussion is shown 
in Figure 5.90. The system with positive feedback has the transfer function: 


1 


S(w) = 


where F(q@) is the open loop gain. Obviously, the transfer function grows beyond all 
bounds — becomes unstable — for 


F(@) =1. (5.225) 
As F(q@) is a complex function, two conditions need to be fulfilled: 


Re{ F()} = 1 (5.226) 
Im{F(o)} = 0. (5.227) 
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Barkhausen self-excitation criterion: system model with positive feedback. 
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Time-domain simulation of oscillator start-up behaviour. 


Non-linearity in oscillators 


The self-excitation criteria introduced so far assumed that the systems under investiga- 
tion were all linear. In practice, however, this would not lead to the desired result of 
well-controlled sinusoidal signal generation. 

Using the T° criterion (Equation (5.222)), the initial |[osc| should be significantly 
(10-20%) higher than |I’;es| for the reliable onset of oscillation. However, the oscillation 
amplitude would then grow beyond all bounds, or in practice until it is limited by the 
supply voltage. 

Fortunately, the active components we are using in the oscillator core to generate 
the negative resistance all exhibit gain saturation, i.e. the differential gain decreases 
with increasing signal amplitude. This leads to a self-stabilisation of the oscillation 
amplitude. 

Figure 5.91 shows a time-domain simulation of oscillator start-up behaviour using a 
non-linear active device model (here, a Si/SiGe HBT). Note how the device linearity 
leads to a steady-state oscillatory behaviour after only a few cycles. Looking carefully, 
you will also notice that the initial frequency of oscillation is different from the one in 
steady state — this is caused by the reactive component of the oscillator core impedance, 
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which also shows a non-linear behaviour and varies as the amplitude increases. The 
effect, which is undesirable, is frequently referred to as chirp. 


Oscillator topologies 


The negative resistance necessary to fulfil the condition |Fosc| > 1 for the oscillator 
core can be generated in a variety of ways: 


The active devices in the oscillator core could have a negative differential resistance of 
its quasi-stationary I-V characteristics. Examples are tunnel diodes, Gunn diodes or 
exotic devices such as real-space transfer transistors. Except for Gunn diodes, which 
are still being used in inexpensive microwave modules (e.g. motion detectors), they 
have no longer (or never had) any commercial significance. 

The active device could incorporate a transit-time region which leads to a phase lag 
between the applied voltage and the current through the device. If this phase shift is 
larger than 2/2 at a given frequency, the resulting impedance at that frequency has a 
negative real part. An example of such a device is the IMPact ionization Avalanche 
Transit-Time (IMPATT) diode, which is still being used extensively in millimetre- 
wave oscillators. 

The most common way to generate negative resistance is the use of positive feedback 
around an amplifying device. 


The last item will be discussed in more detail here. 

There are several ways of introducing positive feedback around an amplifier. Four of 
them are shown in Figure 5.92. 

The configuration in Figure 5.92(a) uses magnetic coupling in a transformer around a 
common-source (or common-emitter) stage. As the common-source amplifier is invert- 
ing, a reversal of the winding sense in the transformer is necessary to generate the 
required positive feedback. Realised with vacuum tubes, this circuit was known very 
early in the history of radio and is called Armstrong (or Meissner) oscillator. 

The Hartley topology (Figure 5.92(b)) uses a tapped-inductor feedback path around 
a non-inverting common-drain stage. The Colpitts oscillator (Figure 5.92(c)), uses a 
similar concept, but a capacitive voltage divider instead of the tapped inductor — it is 
therefore easier to realise in integrated form, and probably the most popular topology 
for MMIC implementations. When the inductor is replaced by a series LC circuit, the 
Clapp topology results (Figure 5.92(d)). 

Due to its popularity, we will examine the Colpitts topology in more detail 
(Figure 5.93). 

The current through capacitor C2 is 


ic2 = Ugs [gm + o(C1 + Cos) ] , (5.228) 


which leads to the input voltage 


(5.229) 
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(a) (b) 


(c) (d) 


Common oscillator topologies: (a) Armstrong or Meissner, (b) Hartley, (c) Colpitts 
and (d) Clapp. 
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Simple equivalent circuit of the Colpitts oscillator topology. 
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The input current is 
lin = Ves JO(C1 ca Cys). (5.230) 
This allows us to calculate the input impedance of the oscillator core: 


i Ce 
R= = = + te (5.231) 
lin w?C2(Cy + Cgs) go Cy + C2 + Cos 


The negative real part of the input impedance is obvious; the imaginary part is simply 
the series connection of the capacitances at the input. 

Another aspect of the Colpitts oscillator is apparent: especially at high frequencies, 
the capacitor C} is not really necessary, the Cgs of the device suffices. 

A very popular topology, especially for RF CMOS circuits, is the cross-coupled pair, 
shown in Figure 5.94(a). It is a differential pair with the gate of each transistor connected 
to the drain of the opposite transistor. Because of the 180° phase shift between both 
branches, this forces the small-signal gate voltages to be 


Ugs] = —Ugs?. (5.232) 
Remember that in a differential pair under perfectly differential excitation, all nodes 
along the vertical centre plane are virtual grounds. Then the circuit behaviour can be 
completely described using the half-circuit in Figure 5.94(b). To maintain symmetry, 


Co has been replaced by the series connection of two capacitors of 2Co, while Lo is 
divided into the series connection of two inductors with Lo/2. 


Lon Lyf? 
Lo 
C if Cy s2 


—> 


Vgs2 
ais \— 02 aileg 
| ye" 

Vgs1 Vgs2 = 


(a) (b) 


<<) 


(a) Simplified circuit of a cross-coupled pair oscillator and (b) its equivalent half-circuit. 
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Let us now calculate the admittance seen by the resonator towards the oscillator core: 


Vos] + Vges2 J OCos 
me 1 _ &mUgs1 gs2 J gs2 (5.233) 
Ugs? Ugs2 
= —8m + JOCgs2, 


using Equation (5.232). The negative real part is hence —gm, and the oscillation 
frequency is 
1 


Vv Lo(Co + Cos) 


oo = (5.234) 
assuming Cgs1 = Cgso = Cgs. 

Finally, negative resistance can also be generated using common-gate (or common- 
base) configurations. 

In the common-gate amplifier stage shown in Figure 5.95, note the inductance Lo 
inserted into the gate lead. We will calculate the small-signal input impedance for this 
circuit. The input current is 


i] = —Vgs(8m + J@Cgs). (5.235) 
The current through Lo is vgs J @Cgs, So the input voltage vj is (after a short calculation) 

v1 = —Ugs(1 — w*LoCes). (5.236) 
The input impedance is then, after separation into its real and imaginary parts, 


UL mC — w* LoCgs) é 1— wo LiCn 
i ge tercR SF gh + aPC 


Z\= (5.237) 
Now remember that the transit frequency wr © gm/Cgs and assume that 


Kw <« Or. 


LaCa 


RL 


| Coe 
— 
Oscillator core using a common-gate stage. 
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Then, we obtain 


w* Lo w 
Z=- . + Jolo->- (5.238) 
T wo 


T 


Provided that the frequency is larger than the resonant frequency of Lo, Cgs, this circuit 
will, therefore, also generate a negative resistance. The imaginary part is inductive. 


Voltage-controlled oscillators 

Electronic control of the oscillation frequency is a very common requirement. While 
both variable inductance and variable capacitance concepts are in principle feasible, 
variable inductance approaches suffer from poor integrability. Attempts to realise inte- 
grated variable inductors using micro-electro-mechanical structures (MEMS) exist, but 
have not found practical use yet. 

In practical applications, variable capacitors, in turn, are always realised using varac- 
tor diodes, which may build upon p—n junction diodes, Schottky diodes, or MOS diodes, 
depending on the underlying semiconductor technology. In each case, the capacitance 
of a blocking diode structure is varied by changing the voltage across the diode. Vari- 
able capacitors using MEMS have been investigated quite extensively, but again have 
not reached sufficient maturity for commercial applications so far. 

The voltage-controlled oscillator (VCO) example in Figure 5.96 uses the cross- 
coupled differential pair topology introduced in Figure 5.94(a). The fixed capacitor 
has been replaced by two varactor diodes. The tuning voltage is connected to a virtual 
ground point, which facilitates decoupling between the RF and the DC control paths. 

The series resistance of the varactor diodes should not be overlooked — it may 
substantially lower the overall resonator quality factor. The total quality factor can be 
shown to be 


i 
y 
— 


Lo 


| 4 
Viune | | 
PAW 


Q1 Q2 


IH 


VCO using a cross-coupled topology. 
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Oc OL 
QOc+ Qt’ 


ee ae Oc=-— = fot 
i Ry C Ry’ no) oa C Rc 


are the quality factors of an inductor with series resistance Ry, and a capacitor with series 
resistance Rc, respectively, in a parallel resonant circuit. The assumption of weak losses 
has been made: 


Orotal = (5.239) 


where 


ws L 
2 
Rr 


> 1, a2 R2C? «1. 


Noise in oscillators 


The noise phenomena in active devices of course also affect oscillator performance. In 
principle, two things may happen: 


e The oscillation amplitude may fluctuate randomly with time — amplitude noise. 
e The phase of the sinusoidal signal may fluctuate randomly with time — phase noise. 


Of the two, amplitude noise is the least critical. First of all, the gain compression effect 
which leads to a stable oscillation condition in the first place also reduces random 
amplitude fluctuations. Also, in frequency translation applications (frequency up- or 
down-conversion), oscillators typically work into switch-type mixers where the oscilla- 
tor amplitude has little effect on the conversion efficiency, provided it is still sufficient 
for switching operation (see Section 5.6, p. 396). 

For these reasons, we will restrict our discussions to phase noise, which has a much 
stronger impact on system performance. 

Phase noise describes the random fluctuations of the oscillator phase with time. 
Mathematically, 


s(t) = Asin[ot + o(t)]. (5.240) 


In the frequency domain, these phase fluctuations are manifest in noise sidebands close 
to the carrier (see Figure 5.97). The spectral power density of the noise sidebands 
decreases with increasing distance to the carrier. They have a number of detrimental 
effects, for example 


e in transmitters, they lead to interference in nearby channels; 

e in receivers, phase noise increases the perceived in-channel noise due to a phe- 
nomenon known as reciprocal mixing, which is an intermodulation effect between 
a strong interferer and the local oscillator noise sidebands. 


The customary figure of merit for phase noise suppression is the ratio between the car- 
rier power and the power of the noise sidebands in a | Hz bandwidth at an offset Af 
from the carrier. It is typically expressed in ‘dBc/Hz’. 
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Phase noise sidebands around a carrier in the frequency domain. 
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Noise equivalent circuit of an ideal oscillator core with a lossy resonator. 


To appreciate the effect of the resonator on the oscillator noise performance, con- 
sider Figure 5.98. The oscillator core itself is assumed to be noiseless (a strong 
over-simplification). The imaginary part of its input admittance is merged into the LC 
resonator; at the resonant frequency, the oscillator core presents a real part —G which 
just offsets the resonator loss G. 

The resonator dissipative element, G, creates a thermal noise current whose noise 
phasor is (see Equation (5.82)) 


(lini?) — 8kT BG, 


where B is the measurement bandwidth. 

The voltage v is ip - Zres, Where Zyes is the resonator impedance. Because the res- 
onator losses are exactly compensated, it is the impedance of an ideal parallel LC 
resonator: 
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Pi I (5.241) 
Les = TLC | 
% 


Taylor series expansion of the denominator, aborted after the linear term, leads to an 
approximate impedance for small deviations dw from the resonance wo: 


7 JooL wo 1 
Zres(@o + Aw) © F Zhe =) 2Aw OG (5.242) 
using Q = R/w@oL = 1/(@pLG). 
The phasor of the noise-generated voltage v, is then 
; 2 8kTB 
(ivi?) = (iin) |Z,es|_ * —————> (5.243) 


2 
2A 
G Q? ( Aw) 
using the approximation in Equation (5.242). 


Comparing the noise power in bandwidth B to the signal power Ps, finally we obtain 
the phase noise suppression: 


ase wo ) 
L(Aw) = —— ; (5.244) 


Note the quadratic dependence of the phase noise suppression on the resonator Q. 

In practical cases, the oscillator core is not noiseless. Furthermore, we need to take 
low-frequency noise phenomena into account, which lead to a stronger increase in phase 
noise close to the carrier. Leeson (1966) [26] introduced the following semi-empirical 
formula, which builds upon Equation (5.244): 


ae _ 2FKTB ; ay \? ; Qn fe oie 
roe + (5x85) (1+ Fe). re 


The additional factors are F — accounting for the additional noise in the oscillator core — 
and fo, the cutoff frequency for low-frequency (*1/f’) noise. 

Figure 5.99 shows an example calculation using Equation (5.245). We clearly 
distinguish three different regions: 


(i) Close to the carrier, the noise power drops with —30dB/decade. Here, the low- 
frequency noise increase in the oscillator core’s active devices dominate. 

(ii) Further out, the decay is —20dB/decade, corresponding to the earlier calcula- 
tions. Here, white noise sources (such as the thermal noise provided by the lossy 
resonator) dominate. 

(iii) Far away from the carrier, a noise floor is visible, but this is generally not very 
relevant, unless F is very large. 


If low phase noise oscillators are a requirement, both high quality factor resonators 
and active devices with low-frequency noise are a must. In general, bipolar devices 
(including HBTs) will fare much better than FETs. On-chip resonators generally have 
much lower Q than off-chip resonators can achieve. 
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Simulated phase noise sideband of an oscillator using the Leeson equation. 


Mixers 


Mixers are generally frequency-translation components, with a variety of applications 
in analogue signal processing, such as frequency shifting of signals (up and down con- 
version), phase shifting, modulation and demodulation. A special class of mixers — 
four-quadrant multipliers — can also be used in correlators, for example in impulse-radio 
ultra-wideband receivers. 

The mathematics behind mixer operation has been reviewed already in the context of 
non-linear amplification (p. 370 and following). Recall that if we take a signal consisting 
of two sinusoidal components of different frequencies w; and w and feed it into a 
non-linear two-port, the output /(t) can be described by a Taylor series expansion: 


A(t) = ky [asin(ayt) + b sin(w2t)] + ko [a sin(w2t) + bsin(wat)P +... (5.246) 
The quadratic term expands to 
ky [a sin(wt) + b sin(w2t) P (5.247) 
= ky [2 sin2(w,t) + b? sin? (wot) + 2ab sin(wyt) sin(w2r) | 
The product term can be expressed as 
2ab sin(@,t) sin(@2t) = ab {cos[(@, — @2)t] — cos[(@1 + w2)t]}. (5.248) 


Any non-linear system under two-tone excitation will therefore produce spectral compo- 
nents at the sum and difference of the input signals. We also conclude that an analogue 
multiplier would be the ideal mixer, because it only produces the sum and difference of 
the input spectral components. 

Incidentally, the interaction used here for frequency mixing purposes, was called 
second-order intermodulation in the context of non-linear amplifiers. 
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Transconductance multiplier 


A simple analogue multiplier can be realised using a bipolar differential amplifier. It 
utilises the fact that in bipolar transistors, the small-signal transconductance is linearly 
dependent on the large-signal collector current in the operating point. Now take a differ- 
ential pair Q;, Q2 with acommon current source transistor Q3 (Figure 5.100). The RF 
input voltage v, is fed differentially into the top transistor pair (superimposed on the bias 
voltage Vo), while the oscillator current signal, JLo, is fed single-endedly into the base 
of Q3. Let Q3 have a current gain B. Then, the differential output signal is 


BR, 
v2 = V1igmRL = vilLoz—,; (5.249) 
2Vr 
because 
_ Blo 
8m = 2Vr 


The output voltage is therefore proportional to the product of the input voltage and the 
local oscillator current. 

This simple circuit, however, has a number of drawbacks. First of all, it will only 
work for [Lo > 0. Secondly, the input voltage has to be much smaller than Vp (or 
26 mV at room temperature) to fulfil the small-signal assumption. Finally, it will only 
work with bipolar transistors. 


ho 


Transconductance multiplier circuit. 
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Resistive mixer 


After looking at a mixer circuit which works only with bipolar transistors, let us briefly 
consider a circuit which will work only with FETs. To appreciate its mode of operation, 
examine the output I—V characteristics of a FET (see Figure 5.101). 

At low Vps, the relationship between Jp and Vpgs is an approximately linear one 
and can therefore be accurately described by the channel conductance gps. Assuming a 
simple Statz—Curtice model for the FET: 


B(Vas — Vp)? 
Ip (Vas, Vos) = ———- tanh(y Vps), 
pb (Vas, Vos) 1+ a(Vos — Ve) (y Vos) 
where Vp is the pinch-off or threshold voltage and a, 6 and y are model parameters, we 
find 


B(Vas — Vp)? 
1+a(Vces — Vp) 


B(Vas — Vp)? 
1+.a(Vcs — Vp)’ 


[! tanh2(y Vos) | my (5.250) 


§Ds = Y 
provided that tanh? (y Vos) « loryVps < 0.3. 

Now the FET is placed in a circuit as shown in Figure 5.102. The inductor L enforces 

a steady-state bias point Vps,o = 0, but is invisible at the LO, RF or IF frequencies. The 

gate bias can be set to a suitable Vgs for optimum mixer operation. The FET, together 

with the generator resistance of the RF port Rep, forms a resistive voltage divider whose 
division ratio is modulated by the gate potential: 


Vrar(t) 


Vps (t) = ————____. 
oe) 1+ gps(t) RR 


(5.251) 


The periodic variation of gps by the gate potential Ves = Vcos.o + Vio sin(@Lot): 


gps = By [Ves.o — Vp + Vio sin(wrot)]” (5.252) 


———_ 


Vp Ss 


FET output I-V characteristics with indication of variable resistor operation. 
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Fig. 5.102 (a) Resistive mixer configuration and (b) its simplified equivalent circuit. 
results in the desired mixing action (pure square law behaviour, i.e. a = 0, is 


assumed here, see Equation (5.250)). Note that Equation (5.252) only holds for 
Vio < Vas,o — Vp. 

Frequently, the local oscillator voltage will be chosen such that Vio > Voas.o — 
Vp — the channel conductance is then switched between two saturated states, gps = 0 
and a high state essentially limited by the source and drain series resistances, which 
were initially omitted in the simplified discussion. Under this condition, and assuming 
Vear(t) = Ver sin(@ppt), the time-dependent drain-source voltage becomes 

Vere sin(@ppt) 


Vps(t) = —————— [R R R t t)]. 5.253 
ps (t) Rs + Rp + Ren s + Rp + Rerrect(wiot)] ( ) 


Since the Fourier series of the rect function is 


(5.254) 


1 
rect(w_ot) = 7 


+ = sin oro + sinGeLot) +... | : 
a 3 
this switching mode operation leads to the desired multiplication with the fundamen- 
tal frequency of the local oscillator, but also with higher-order harmonics — the latter 
operation is referred to as sub-harmonic pumping. 
Figure 5.103 shows as an example the measured conversion gain of a resistive 


mixer, as a function of the local oscillator power. At low Pio, the conversion gain 
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increases proportionally with the local oscillator power — the mixer acts like a mul- 
tiplier. For higher Pio, the mixer enters the switching regime — the conversion gain 
is roughly independent of the local oscillator power. This behaviour can be seen, 
in variations, for any of the mixer circuits discussed here. In standard wireless sys- 
tems, mixers are operated in the switching regime, because independence from local 
oscillator power fluctuations is highly desirable. For example, it reduces the effect 
of oscillator amplitude noise. There are applications, however, where operation in 
the multiplier regime is wanted. An excellent example are correlation receivers for 
impulse-radio ultra-wideband systems, where the formation of a cross-correlation 
between the received pulse and a template pulse in the receiver calls for a true 
multiplier. 

The resistive mixer has the advantage of high simplicity, and, most prominently, very 
high linearity with respect to the RF port. The latter is especially true when local oscil- 
lator power leaking through the gate-drain capacitance is short-circuited to ground, so 
that the drain potential is not modulated at the LO frequency [27]. 

Disadvantages are the significant required local oscillator voltage swing and the 
conversion loss inherent to the voltage divider principle. 

An important issue for practical mixers is port isolation — ideally, power fed into 
the LO port should not be present at the RF and IF ports, and power fed into the 
RF port should not be present at the IF port in downconverters, while power fed into 
the IF port should not leak to the RF port in upconverters. In the resistive mixer, 
the isolation between the RF and IF ports is realised by filters only, which is a 
significant disadvantage. The LO-to-RF and LO-to-IF isolations are somewhat bet- 
ter because the leakage path is via the gate-drain capacitance, but it may still be 
too high. 

Mixer concepts where port isolation is assisted by destructive interference are much 
better, in this respect. They will be treated next. 
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Single-balanced mixer 


Consider the circuit in Figure 5.104. Transistor Q; is a common-source amplifier stage 
fed by the RF signal. Its load is formed by the differential pair Q2, Q3 with load resis- 
tances RL. Ro and R3 are for biasing only. Q2 and Q3 are driven by the local oscillator — 
but due to the balun transformer,’ their gate signals are exactly 180° out of phase. 
Assume that the local oscillator signal is large enough so that the Q2 and Q3 are 
being switched off alternately. Then, the differential voltage Vip can be written as 


Vir = RuIp [2 - rect(wiot) — 1), (5.255) 


where Jp, is the drain current of Q,. Provided that Q, is operated in the small-signal 
regime, the small-signal IF output voltage is, considering the fundamental frequency 
component of the local oscillator signal only, 


~ 4 R 
Vir = ye SiN WLC SiN wRFt, (5.256) 
me 


using Equation (5.254) and Ver = Vie sin(wpof). 


The desired multiplication is again visible, producing spectral components at mrp + 


WLO.- 
The mixer uses a special property already discussed in the context of differen- 
tial amplifiers: the common-source connection of transistors Q; and Qo is a virtual 


Vop ° 


=e Q; th 


© eH 


aos ah 
RF O | | P| 0, 
Rg 
Ves1 


Single-balanced mixer circuit. 


7 Note that baluns are drawn as transformers in this and the following circuit diagrams. At micro- and 
millimetre-wave frequencies, they are rarely transformers, but may be realised using transmission line 
segments, or as active baluns. 
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Single-balanced upconversion mixer with FETs in shunt configuration. 


ground for purely differential excitation. Therefore, the local oscillator and RF ports 
are highly decoupled by virtue of destructive interference. LO-to-RF leakage is very 
critical, because the local oscillator signal must not radiate via the antenna. 

The circuit shown in Figure 5.105 is also a single-balanced mixer, used to upconvert 
a signal at an intermediate frequency to a higher frequency (RF). As opposed to the cir- 
cuit shown in Figure 5.104, where the switches were in series configuration, they are in 
shunt here, alternately connecting the two ends of the top balun to ground, periodically 
changing the sign of the IF signal at the RF port and thus producing the desired mul- 
tiplication of the LO signal with IF. As upconversion is what we desire, a subsequent 
filter will have to suppress the difference frequency and pass only the sum. Because the 
RF port is connected to the virtual ground connection with respect to fully differential 
excitation of Q; and Q2, destructive interference of potential LO leakage at the IF port 
will again ensure a very high LO-to-RF isolation. 


Double-balanced mixer 


The configuration shown in Figure 5.104 is a building block of the double-balanced 
mixer, probably the most popular active mixer configuration in MMIC design today. 
These topologies are commonly referred to as Gilbert cells [12]. 

In the double-balanced mixer (Figure 5.106), all signals have to be applied in a dif- 
ferential fashion — this is indicated by the presence of three baluns. Transistors Q, 
and Q» form a differential amplifier, connected to another pair of differential ampli- 
fiers Q3— Qs. In the original publication by Gilbert, the configuration is operated as 
a true four-quadrant multiplier, while in most applications, the top four transistors are 
switched by the LO signal between two states — they are jointly referred to as the switch- 
ing quad. Only the latter mode shall be discussed here. In this case, the double-balanced 
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Block diagram of a double-balanced mixer of the Gilbert cell type, partitioned into a differential 
amplifier and a phase-reversing switch. 


mixer can be viewed as the cascade of a differential amplifier and a phase-reversing 
switch, such that the amplified RF signal changes its phase by 180° with the period of 
the LO signal. This is shown schematically in Figure 5.107. 

The big advantage of the double-balanced mixer is that all ports are now decoupled 
by destructive interference. For example, the differential RF signal leakage cancels out 
at both joint gate connections of the switching quad. Likewise, the LO leakage cancels 
at the source connections of Q3, Qa and Qs, Q¢6, respectively. 

A disadvantage of the circuit shown in Figure 5.106 is the rather large voltage head- 
room required. On top of the required minimum drain-source voltage of the switching 
quad and the differential amplifier Q;, Q2, additional headroom is required for the cur- 
rent source Jo. The latter is frequently replaced by a parallel resonant circuit, tuned to 
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Double-balanced mixer with a ring topology. 


ore. It provides the required high impedance for the RF signal, but provides a zero DC 
resistance, lowering the supply voltage requirements. 

Double-balanced mixers can also be realised using resistive mixer principles (see 
Section 5.6.2). This has the advantage of very low power consumption, but of course 
the circuit will not be able to produce any conversion gain. 

Figure 5.108 shows an example. Depending on the choice of the drain potential Vpp, 
this circuit can be operated as a resistive mixer (Vps low, in the linear regime), or the 
transistors may act as current switches (Vps in the saturated region). The RF signal is 
applied to the drains of the transistors. The LO signal alternately turns on transistors 
Q1, Q2 and Q3, Qa, leading to a periodic phase reversal of RF signal at the IF port. 


Micromixer 


The micromixer concept, shown in Figure 5.109, evolved from the Gilbert multiplier 
topology and was also published by B. Gilbert [13]. The transistors Q4—Q7 are the 
switching quad, as before — the circuit is drawn using bipolar transistors here, but would 
work as well with FETs. The topology is somewhat simplified, for example the baluns 
are not included. 

The local oscillator signal is again applied in differential format. The RF signal, 
however, is single-ended and applied to the amplifier structure formed by the transis- 
tors Q1—Q3. Q, is a common-base amplifier stage and Q3 a common-emitter stage. 
Because the former provides a non-inverting and the latter an inverting voltage gain, 
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Fig. 5.109 Simplified micromixer topology using bipolar transistors. 


the signals fed into the two branches of the switching quad are 180° out of phase — the 
circuit doubles as an single-ended-to-differential converter. Q2 forms a current mirror 
with Q3 such that the currents in both branches are equal. 

Frequently, the gain of the common-base/common-source amplifier/balun is 
increased by injecting additional current into the two branches. This is indicated in 
Figure 5.109 by the two dashed current sources Io. 

Compared to the original Gilbert multiplier, the input stage of the micromixer can be 
made more linear, and can be designed to achieve a broadband match, using the resistors 
R, and Ro. 
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Diode-based mixers 


While the circuit design descriptions in this chapter (and, for that matter, the whole 
book) emphasise concepts with active components, the use of diodes in mixers must 
be mentioned here because they are still very commonly employed, especially at 
millimetre-wave frequencies where transistors lose their usefulness. The diode of choice 
is the Schottky (metal—semiconductor) diode due to the absence of carrier storage effects 
when switching from forward to reverse bias. 

As in the case of operating a FET as a resistive or switching mixer, the principle 
is the change of the differential resistance. For a diode, this is shown in Figure 5.110. 
Clearly, the differential diode conductance gp = dIp/dVp is very low for bias point 
A (or equally for Vp < 0) and very high for bias point B. We will now use the local 
oscillator again to periodically change the diode between these two states. 

Consider Figure 5.111, which depicts the very popular diode ring mixer. The circuit 
is very similar to the FET ring mixer shown earlier (Figure 5.108). Assume that the 
RF signal is always much smaller than the local oscillator (LO) signal. Then, the diode 
state will depend on the applied LO signal only. Either the left or the right diode pair 
may conduct and exhibit a high differential conductance. The diode ring acts as a phase- 
reversing switch, which connects the RF signal to the intermediate frequency (IF) load 
with periodically alternating polarity. 

Similarly, single-balanced and single-ended (unbalanced) mixer topologies can be 
realised using diodes. This shall not be further expanded here. 


Ip/A 


0.04 } 4 
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Example Schottky diode I-V characteristics, with bias point indicated for small (A) and high 
(B) differential conductance. 
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Diode ring mixer. 
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The image frequency problem in downconversion mixing: ORF and @jmg both produce 
components at wpe when mixed with a 9. 


Image-rejection mixer topologies 


One problem inherent to any of the mixers discussed here still needs to be mentioned. 
When operated as a downconverter (converting an RF signal at a higher frequency to a 
lower IF), it uses the fact that the multiplication of sinusoidal signals produces a spectral 
component at the difference of the two initial frequencies, as was discussed in Equation 
(5.248). 


2 sin(@;t) sin(w2t) = cos [(@1 — w2)t] — cos [(@, + @2)ft]. 


Consider now a case where @; = opr is the RF signal and w2 = @yo is the LO 
frequency, which is lower than wrr. The difference is the IF: opr — OLo = or. If, 
however, there is another signal present at the RF input, with a frequency @img = ORF — 
2ejp, it will also mix with @Lo to produce a spectral component at the right IF: oLo — 
®img = tp. The problem is schematically shown in Figure 5.112. wimg is called the 
image frequency. 
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The signal at the image frequency will now overlay the downconverted RF sig- 
nal and in most cases cause unacceptable interference. The most common way to 
avoid this is to make sure, by appropriate filtering, that no signal is in fact present at 
Oimg- 

In upconversion mixers, the image problem also applies. Upconverting a signal at 
ar by means of an LO at a higher frequency wer results in two spectral components at 
@LO = wr. Again, one of these components will have to be suppressed, classically by 
filtering. 

However, in modern receiver and transmitter concepts, there is a trend to lower IFs 
where digital/analogue conversion can be achieved very inexpensively. Consequently, 
the frequency distance 2m 1p between the desired signal and the image is getting smaller, 
requiring very rigid filtering. The necessary filter qualities are rarely possible on chip, 
which is another significant drawback, as off-chip filters are costly in production and 
assembly. 

A more convenient solution is the use of an image-rejection mixer. The image- 
rejection mixer topology can use any of the fundamental mixer circuits we discussed. 

The block diagram shown in Figure 5.113 shows a possible implementation for a 
downconversion mixer. At the input, two signals shall be present, at the RF and image 
frequencies: 


ORF = OLO + IF, img = OLO — @IF 


The signal is split into two signals with equal amplitude, but 90° phase shift (in quadra- 
ture). We assume that this can be done equally for the RF and the image signals — this 
is a good assumption because the concept is especially important if RF and image sig- 
nals are located in close spectral proximity and cannot be separated easily by filtering. 
For simplicity, we assume also that both RF and image signals are sinusoidal with an 
amplitude of 1. 

The oscillator signal, at w_o, shall also have an amplitude of 1. It is split into two 
signals with equal amplitude and phase. 


| > 


0° 
-90° 


0°/-90° 


WREr Yimg — 


Block diagram of an image reject mixer. 
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Let us first consider the RF signal only (the image signal is turned off). The signal at 
the output of the top mixer is 


Lie ; 
gi(t) = 4 sin(wppt) sin(w_ot) 


1 
= {cos[(@RF — @LO)t] — cos[(@RF + @Lo)t]} . 


The high frequency component at wr + wo is easily suppressed in the low-pass filter 
(LP). After the low-pass filter in the upper branch, 


Zit)= ; case = capi] = : cos(wyRt) = : sin (crt el =) _ (5.257) 
In the lower branch, the RF signal is delayed by —zr/2. 
Using sin(wpr — 7/2) = —cos(Mprpt), we obtain at the output of the bottom 
mixer: 
1 . 
go(t) = “4 cos(@Rpt) sin(@Lor) 


1 
= oa [sin(@gp + @LO)t — Sin(@RF — @Lo)t). 


The high frequency component is removed in the low-pass filter: 
1 
g(t) = 8 sin(@ypf). (5.258) 


The four-port at the right of the block diagram in Figure 5.113 is a 90° hybrid cou- 
pler. Signals fed into the inputs emerge at outputs A and B with equal amplitude, but 
the signal entering at the top experiences an additional —z/2 phase shift at output B, 
while the signal entering at the bottom experiences an additional phase shift of —z/2 at 
output B. 
Using Equations (5.257) and (5.258), we then find at A: 

mie 1 4 

3 [sin (crt + =) + sin (cont — =) = 0. 
At B: 


1 1 
8 [sin(@ypt) + sin(@ypt)] = 4 sin(@ypffr). 


The IF signal due to the wanted (RF) signal hence only appears at output B. 
Let us consider the image frequency, which is below w,o such that jp = @LO— img. 


Using sin(—w) = —sina@ and cos(—@) = cosa, we find at the output of the upper 
low-pass filter: 
; 1, a 
gi () = — gin (cont = =) (5.259) 


while at the output of the lower low-pass filter: 


ey = -; sin(@yFr). (5.260) 
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Image reject mixer topology with quadrature LO. 


WREr Wimg 


A ‘third method’ image-rejection mixer topology using two mixing steps. 


These signals combine at output A as 


j [sin (or ~ 5) + sin (ome — 5)] = —G sin (ot — 3) 
8 S1N | @]F 5) S1N | @]F 5) => Plea OIF 5) : 
At B: 


1 
-3 [sin(@ypt — 2) + sin(@ypt)] = 0. 


The IF signal due to the image frequency only appears at port A. The circuit in Fig- 
ure 5.113 hence separates the IF components due to the RF and image signals at the 
input. 

The topology in Figure 5.114 serves the same purpose and was described by Hartley 
already in 1928 [18]. It has the advantage that oscillators can be constructed so that they 
directly generate quadrature output signals, which maintain 90° phase shift over a wide 
frequency band. 

The final topology, introduced by Weaver in 1956, eliminates the 90° hybrid at the 
IF, but uses a second down-conversion step [39]. It is shown in Figure 5.115. 

While it requires two additional mixers and low-pass filters plus an extra LO, it 
eliminates the need for the IF hybrid which, for low IFs, is very difficult to integrate. 
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Mixer noise figure 


The existence of an image frequency also complicates the noise figure assessment of 
mixers; while the intended signal is present at only one frequency (pF), noise present 
at the image frequency wimg is also converted to the IF and decreases the signal-to-noise 
ratio at the mixer output. 

In principle, noise figure definitions for mixers use Friis’ definition of the two-port 
noise figure (Equation (5.71)); the noise figure is calculated as the quotient of the signal- 
to-noise ratios at the input and the output of the two-port, assuming that the noise 
temperature of the input is 7p) = 290K. 

The issue here is how to calculate the signal-to-noise ratio at the input. 

We may consider the signal-to-noise ratio only for the RF frequency. If the signal 
power is S, the signal-to-noise ratio before the mixer is 


S 


SNRinp = ar 


(5.261) 
where Af is the measurement bandwidth. 

The mixer noise sources are combined in an equivalent noise source kT, Af, placed 
at the input. 7, is the noise temperature of the mixer. 

Generally, the mixer’s gain at the RF and the image frequency can have different 
values, which we call Grp and Gimg, respectively. ® 

The image frequency also contributes a noise power of kT) Af. The signal-to-noise 
ratio at the output is then 

Gres 


SNRout = : (5.262) 
kKAf[Gre(To + Tn) + GimgTo] 


The ratio of the expressions (5.261) and (5.262) is called the single-sideband noise 
Jigure: 


(5.263) 


We may also simply apply Equation (5.73), developed originally for two-ports, to the 
mixer. The result is the IEEE single-sideband noise figure: 

Tn 

Fssp IEEE = | + — (5.264) 

Tp 
The IEEE definition is similar to (5.263), provided that Gimg < Grr. However, we can 
also calculate the signal-to-noise ratio before the mixer taking the thermal noise at the 
image frequency into account. Then, 


S 


SNRinp — QkTAF . 


(5.265) 


8 Note that all gains are available gains, i.e. power match is assumed for all ports. 
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Now using (5.265) and (5.262) to calculate the noise figure, we obtain the double- 
sideband noise figure: 


Gime 
Fssp_—-F'SSB,IEEE + G 
= = i — (5.266) 


On a logarithmic scale, Fssp is thus 3 dB larger than Fpsp. 

For the assessment of receiver systems, Fssp is the appropriate entity because we 
need to compare signal-to-noise ratios at the intended frequency only. However, mea- 
surements of mixer noise figure invariably yield Fpsp, unless the image frequency is 
suppressed at the input of the mixer by appropriate filtering. When using mixer noise 
figure data, this has to be carefully observed. 


Baluns, unbals and hybrids 


In the preceding sections, reference was frequently made to mysterious building blocks 
which convert signals from single-end to differential (two signals of equal amplitude, 
but 180° out of phase), or splitting a signal into two parts ‘in quadrature’ (90° out of 
phase). Due to their importance, they also deserve a brief section of their own. 

The term balun is a contraction of balanced to unbalanced, describing the function 
this component performs; they convert a balanced (differential) signal, which can be 
understood as two ground-referenced signals of equal amplitude, but 180° phase differ- 
ence, to an unbalanced signal, which is a single ground-referenced form. The unbal does 
exactly the opposite — it converts an unbalanced signal to a balanced (differential) one. 


Passive baluns and unbals 


Passive baluns and unbals are essentially the same components, operated in different 
directions. Therefore, it is common to call them by the name ‘balun’ only, irrespective 
of the actual role. 

A very common balun/unbal at lower RF frequencies is the centre-tapped transformer 
(Figure 5.116). Due to the symmetric centre tap of the secondary winding, the balanced 
output voltage is formed by two voltages vj and vy of equal amplitude, but opposite 
phase. At lower frequencies, the centre-tapped transformer balun is frequently con- 
structed as a toroidal transformer, using ring-shaped magnetic cores. As the frequency 
of operation increases, this will rapidly not be possible anymore as the useful frequency 


Center-tapped transformer as a balun/unbal. 
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Lumped-element balun circuit. 


range of the core material is exceeded and also the parasitic capacitances of the windings 
become excessive. 

A rather straightforward balun implementation is shown in Figure 5.117. It uses a 
combination of LC low-pass and high-pass filters to provide phase shifts of —90° and 
+90°, respectively, at the design frequency wo. It can provide impedance transformation 
at the same time. Both high-pass and low-pass filters provide the 90° phase shifts if 

L1C, = . (5.267) 
9 
At this frequency, they need to additionally fulfil: 


— =22Z,Z 5.268 

C; 1Z2 ( ) 
to transform between the single-ended port impedance Z, and the differential port 
impedance 2Z>. Solving Equations (5.267) and (5.268) yields 


/ ZZ 


i= (5.269) 
wo 
Li 
Cy = : 5.270 
|= 377 ( ) 


As all lumped-element transformation circuits, the lumped-element balun will be quite 
narrow band. 

At microwave frequencies, a rather large number of possible implementations of 
transmission line baluns exist. We will restrict our discussion to one example. 

A very popular transmission line balun, whose operation is also quite easy to 
understand, is the rat-race coupler structure shown in Figure 5.118. It consists of a trans- 
mission line ring with a circumference of 1.5 times the wavelength i, with four ports 
arranged as shown in Figure 5.118. If equal power distribution to the coupled ports 
is desired, the transmission line ring must have a characteristic impedance of /2Zo, 
where Zo is the port impedance. If we feed a signal into port 1, it will split into two par- 
tial signals, which travel clockwise and counter-clockwise through the ring. They will 
interfere constructively at port 3, 2 and 3, while the phase difference is z at port 4, lead- 
ing to destructive interference there. The signal at port 3 lags port 1 by 2/2, while the 
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A 180° hybrid (‘rat race’) coupler structure. 


signal at port 2 leads port | by 2/2 (well, it lags by 32r/2, but that is the same). Because 
port 4 is decoupled, and we have equal power distribution, the scattering parameters 
with respect to port | are then: 


il 
So, = 5 

1 
S31 =-J— 


V2 
S41 = 0. 


The phase difference between ports 2 and 3 is z or 180° — a single-ended signal into 
port 1 is converted into a differential signal between ports 2 and 3. The component is 
reciprocal, of course — a differential signal applied between ports 2 and 3 will combine 
into a single-ended signal out of port 1. 

The rat-race coupler has another interesting property — a signal inserted into port 4 
will split into in-phase components out of ports 2 and 3, while port 1 is isolated. In 
scattering matrix terms, 


So = —J 


N 


S34 = —J 


SI 


Si4 = 0. 


Active baluns and unbals 


A major disadvantage of the passive baluns discussed so far is their narrow bandwidth of 
operation, and at lower frequencies their potentially large chip area consumption, due to 
either the size of the transmission line segments or the size of the necessary reactances. 

Active circuits can provide balanced-to-unbalanced and unbalanced-to-balanced- 
conversions over a wide operational bandwidth, and often in a very small chip area. 
They are, therefore, frequently used in IC implementations of microwave circuits. 
Disadvantages are the additional noise due to the active components, potential non- 
linearities and the added power consumption, which can be considerable if high linearity 
is required. 
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Active balun circuit. 


Figure 5.119 shows a typical active balun conversion circuit. Transistors Q1—Q3 
form a differential amplifier, which is used here to suppress any common-mode com- 
ponents between the input voltages Vin; and Vin2. Transistors Q4 and Qs5 form the 
balun proper — Q4 acts in common-drain (source follower) configuration, while Q5 is in 
common-source configuration, both working against the common single-ended output. 

The simplest active unbal (Figure 5.120), uses a transistor which is simultaneously 
configured in common-source and common-drain topology. Output | is at the source, 
hence the voltage gain is in good approximation +1. Output 2 is at the drain, and the 
resistors R; and R2 must be chosen such that the voltage gain is —1, leading to the 
desired balanced output signal Vout. Quasistatically, this is very simple — the small- 
signal output voltage vout,2 18 


Vout, 1 
Ro 


Vout,2 = — Ry 


J 


so that for equal magnitudes of Vout,1 and Vout,2, 
R, = R>. 


For higher frequencies, the performance deteriorates, especially due to leakage through 
the transistor’s gate-drain capacitance. The useful range can be extended by cascading 
a differential amplifier with good common-mode rejection. 

Figure 5.121 shows an alternative active unbal, which uses common-source and 
common-gate amplifiers in parallel. Q; is a common-gate topology, with a quasi-static 
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Fig. 5.120 Active unbal using a transistor in common-source/common-drain configuration. 


Vop 


O Vout,1 


i Q, 
= O Vo ut,2 


Vin Oo | | t > Q, 
Fig. 5.121 Active unbal using a combination of common-source and common-gate topologies. 


small-signal gain of gmR1,’ while Q» is acommon-source amplifier with a voltage gain 
of —gmR2. Provided Rj = R2 and equal transistor transconductances gm, both output 
voltages will have the same magnitude, but opposite phase: Vout,1 = Vout,2- 

Compared to the earlier circuit (Figure 5.120), this circuit has the advantage that 
the input impedance is significantly reduced, leading to a higher bandwidth, while the 
output impedances are equal for both ports. 


° Neglecting the transistor output conductance. 
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Quadrature generation 


The generation of two signals with 90° phase shift is another very frequent task, as we 
have seen in the discussion of image-rejection mixer topologies. 

The simplest generation of quadrature signals is realised using the simple RC network 
shown in Figure 5.122. The output voltages are 


1 
Vo = Vy —___ 5.271 
T+ JoRIC) ee 
JORiCi 
| ee (pe i et 5.072 
PN aR Cy O22) 


At wo = 1/(R1C}), Vag lags Vo by 45°, while Vj leads Vo by 45° — Vg and Vy are hence 
in quadrature. The 90° phase difference between Vg and Vj is maintained over a wide 
frequency range, but the amplitudes are equal only at wo. Note also that this is a lossy 
network — at wo, | V1] = |Vol = Vo/V2. 

The polyphase filter family, an example of which is shown in Figure 5.123, uses 
also passive RC elements to generate quadrature output signals from a differential input 


RC polyphase network for differential quadrature generation. 
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signal. It is very frequently used in fully differential receiver concepts. All RC filters 
have significant loss and should therefore not be used in noise-critical signal paths — 
a common use is for quadrature generation from differential LOs, or in quadrature IF 
combiners at the output of image-rejection mixers such as the one shown in Figure 
5.114. An in-depth description of polyphase filter operation can be found in [3]. 

Another common quadrature generator, this time using transmission lines, is the 90° 
hybrid, shown in Figure 5.124. 

All transmission line segments are electrically a quarter wavelength long. A signal 
fed into port 1 will emerge at 2 with 90° phase lag, and with 180° at port 3. The signals 
at 2 and 3 are hence in quadrature. The signal from port 1 will interfere destructively at 
port 4, which is hence ideally decoupled, provided that 2 and 3 are terminated with the 
proper impedance Zo. 

Quadrature signals can also be generated digitally where linear operation is not 
mandated — following an oscillator. 

The circuit depicted in Figure 5.125(a) needs a clock frequency at four times the 
intended output frequency. As state changes occur only on the rising edge of the clock, it 


4 


Transmission line 90° hybrid. 
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Quadrature signal generation using flipflops. 
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is insensitive to the clock’s duty cycle. However, the very high clock frequency require- 
ment makes it unsuitable for many applications. The circuit in Figure 5.125(b) needs 
only twice the output frequency as a clock, but as it triggers on both the rising and the 
falling edges of the clock, it is very sensitive to the clock’s duty cycle. The time delay 
in the inverter may lead to additional problems as the clock frequency increases. 

Finally, quadrature clock signals may also be generated in special oscillator circuits. 
This, however, shall be beyond the scope of this book. 


5.8 Problems 


(1) 


(2) 


(3) 


(4) 


Voo 
Vax R,=R,=50Q 
S C— 00 
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Demonstrate that the power delivered to a load of arbitrary impedance can be 
expressed as the difference in the squared magnitudes of the incident and reflected 
normalised power waves — see Equation (5.3). 


Consider a two-port whose scattering matrix is known. Calculate the power deliv- 
ered to an arbitrary load Z,, as a function of the available power of the generator, 
whose source impedance shall be equal to the normalising impedance Zo. 


You have to design a common-source amplifier with a FET technology whose 
transit frequency fr is 50GHz. The load resistance of the amplifier is given 
to be 100 &, the voltage gain shall be Ay = —10. Calculate the input capaci- 
tance — hint: use the common rule of thumb that Cgp * 0.1 - Ces. You may also 
neglect gps. 


Consider the circuit in Figure 5.126. The transistors Q; and Q> shall have a 
transconductance of g,, = 20 mS and atransit frequency fr = 50 GHz. Calculate 
the input admittance of this circuit. 


C00 


| >| 0, 


tie 
ce 


R= 1002 


Fig. 5.126 Active unbal circuit for Problem 4. 
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(5) 


(6) 


Suggest a simple one-transistor circuit capable of creating an impedance (ref- 
erenced to ground) with a real part Re{Z;} = —400Q at f = 10GHz. 
The FET technology you are using has an fr of 100 GHz and the transistor 
transconductance shall be gm = 50 mS. 


The circuit in Figure 5.127 shall be used to realise a wideband voltage amplifier 
stage. The transistor Q; has an fr = 10GHz and transconductance of gn = 
100 mS in a bias point Jp = 1OmA, Ves = —0.5 V and Vpp is 10 V. The voltage 
gain shall be ay = —10. 

(a) Calculate R3. 

(b) What is the optimum choice for C; with respect to maximum bandwidth, if 
the transistor can be modelled using the simple equivalent circuit in Figure 
5.20? 

(c) Calculate R2. 

(d) What is Vps under these circumstances? 


Voo 


Simple wideband amplifier. 


(7) 


(8) 


(9) 


(10) 


Consider the circuit in Figure 5.32. Given that all transistors have the same size, 
why do Q; and Q> have the same collector current, provided that the current 
gain B > 1? Let now Jc = 5mA for all transistors, 6 = 100, fr = 50 GHz, 
Ry = 100. Draw the small-signal equivalent circuit and calculate the input 
impedance and the voltage gain. 


Discuss three different ways of eliminating the feedback capacitance in ampli- 
fiers. 


The impedance seen in the input of a common-source amplifier stage at f = 
10 GHz is Z; = (5 — 7159) Q. The output admittance of the preceding stage 
is Y7 = 10+ 7,3 mS. Suggest a suitable matching network so that the available 
power is transferred from the first to the second stage. Hint: the Smith chart is 
very helpful here. 


You have the task to design a simple distributed amplifier. The transistors to use 
have a transconductance gm = 400 mS and fr = 70 GHz. The gain cell shall be a 
simple common-source stage. The gate width of each transistor is Wg = 100m. 
For the drain-gate capacitance, use Cpg = 0.1Cgs. 
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The unloaded gate line has a characteristic impedance Z, = 80 and an 


inductance per unit length of L’ = 0.8nHmm!. 


Calculate the necessary length of the gate line segments so that the character- 
istic impedance of the loaded gate line is 5082. The drain line shall equally have 
this impedance. 


(11) In the schematic in Figure 5.63, identify the function of each of the transistors 
shown. What is the purpose of the RC combination attached to the emitters of 
two of the transistors? 


(12) A communications system can tolerate a minimum third-order intermodulation 
distance of 60 dB; the maximum input power is —20 dBm. What is the necessary 
input-referred third-order intercept point? 


(13) Explain why a high resonator quality factor is especially important in oscillator 
using active devices with little low-frequency noise, such as Si/SiGe HBTs. 


(14) Show how the Hartley image reject topology (Figure 5.114) achieves separation 
of the signal and image frequencies to output A and B. 
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absorption/gain spectra, 245 
absorption processes, 21-24 
acceptor, 19 
ACPR, see adjacent channel power ratio (ACPR) 
active baluns and unbals, 414-416 
active forward operation, 116 
adjacent channel power ratio (ACPR), 375-376 
amplifier configurations 
common drain/common collector, 321-323 
common gate/common base, 319-321 
common source/common emitter, 316-318 
Miller effect, 318 
with two transistors, 329-336 
Battjes fr doubler, 333 
cascode amplifier, 333-336 
common-drain/common-source, 329-332 
Darlington amplifier, 332-333 
y matrix representation, 315-318 
amplifier linearity, 370-376 
adjacent channel power ratio, 375-376 
single-tone excitation, 370-372 
two-tone excitation, 372-375 
ANN, see artificial neural networks (ANN) 
anti-guiding parameter, 244 
APD, see avalanche photodiodes (APDs) 
arbitrary terminations, 299-300 
Armstrong oscillator, 388 
artificial neural networks (ANN), 168-169 
associated gain, 312 
atomic bonding, 8 
covalent bonding, 8—9 
atomic physics, 9 
Bohr model, 10-11 
photoelectric effects, 9-10 
avalanche photodiodes (APDs), 275, 276 
input signal photocurrent for, 276 
ionisation coefficient, 272-273 
multiplication factors, 273-274 
optical generation rate, 273 
(SAM) separate absorption and multiplication, 
275: 
avalanching, 120 


bandgap energy, 222, 262 
band line-up, 38-39 
bandwidth 
enhancement, 328 
improvement, 326-329 


base design dilemma, 136-137 
base push out, 125 
base resistance, 128 
base transit time, 126 
base transport factor, 121 
Battjes fr doubler, 333 
bipolar and hetero-bipolar transistors, 115 
heterojunction 
base design dilemma, 136-137 
drift base, 139-140 
HBT implementations, 140-143 
TII-V versus Si/SiGe HBTs, 146-147 
Si/SiGe HBTs, 143-146 
wide-gap emitter, 137-139 
homojunction, 115-116 
collector current equation, 117-118 
diffusion triangle, 116-117 
Early effect, 122-124 
ideal base current, 118-119 
ideal current gain, 119 
Kirk effect, 124-125 
non-ideal current contributions, 119-120 
non-ideal current gain, 120-121 
saturation, 121-122 
large-signal modelling, 147 
BJTs and HBTs, 155-156 
Ebers—Moll model, 147 
Gummel—Poon model, 147-150. 
MEXTRAM model, 152-153 
VBIC95 model, 150 
microwave noise performance of bipolar 
transistors, 131-134 
small-signal dynamic behaviour, 125-129 
maximum frequency of oscillation, 130-131 
transit frequency, 129-130 
transit time optimisation 
collector transit time optimisation, 135-136 
drift field in base, 134-135 
bipolar junction transistors (BJT), 46 
BJT, see bipolar junction transistors (BJT) 
BJTs and HBTs, 155-156 
Bloch theorem, 16 
body-centred cubic lattice (BCC), 6 
Bohr formulation, 11 
Bohr model, 10-11 
Boltzmann’s constant, 222 
Bragg frequency, 356 
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broadband amplifier techniques, 350-354 diffusion current, 29 
feedback techniques, 353-354 drift current, 29 
shunt peaking, 351-353 cascode 
building blocks for high-speed analogue circuits, amplifier, 333-336 
291 topology, 368 
baluns, unbals and hybrids cavity photons, 252-253 
active baluns and unbals, 414-416 channel, 46 
passive baluns and unbals, 412-414 channel length modulation, 103 
quadrature generation, 417-419 channel noise, 63 
mixers, 396 COBRA current equation, 89 
diode-based, 406-407 collector 
double-balanced, 402—404. current equation, 117-118 
image-rejection mixer topologies, 407-410 resistance, 128 
micromixer, 404-405 transit time optimisation, 134-136 
noise figure, 411-412 common-collector configuration, 321 
resistive, 398-400 common-drain configuration, 321 
single-balanced, 401-402 common-source configuration, 329-332 
transconductance multiplier, 397 common-emitter configuration, 316 
oscillators, 383 common source configuration, 316 
noise, 393-396 confinement configuration, 230, 232, 236, 238, 239, 
non-linearity in oscillators, 387-388 248, 257 
oscillator topologies, 388-392 conformal mapping, 296 
resonators, 383-385 constant-velocity approximation, 51—55 
self-excitation criteria, 385-387 constant-velocity model, 52-53 
transistor amplifiers, 313-314 cost function, 163-164 
broadband amplifier techniques, 350-354 Coulomb scattering, 67-69 
configurations, 314-323 covalent bonding, 3, 8-9 
configurations with two transistors, 329-336 critical thickness, 39 
differential, 336-341 crystal directions, 7 
distributed amplification, 354-365 and planes, 6-8 
feedback, 323-329 crystal structure, 5—6 
linearity, 370-376 current continuity, 50 
low-noise, 365-370 current-voltage characteristics, 33-34 
power, 376-383 
source-coupled, 341-342 dark current flowing in slab, 263 
tuned, 342-350 Darlington amplifier, 332-333 
two-port networks, basic relations for de Broglie relation, 11-12 
impedance matching, 297-298 density of states, 3, 19, 20, 35, 36 
Mason’s unilateral gain, 305-306 depletion, 30 
maximum available gain and maximum stable approximation, 30 
gain, 305 capacitance, 33 
maximum frequency of oscillation, 306 layer width, 32-33 
power gains for amplifier design, 298-302 device model optimisation, 163-164 
scattering parameter theory, 291-295 diamond, 8 
Smith chart, 295-297 differential amplifiers, 336-341 
stability, 302-305 common mode, 338-339 
built-in potential, 32 differential mode, 336-338 
bulk semiconductor, 257 neutralisation of, 339 
Burrus diode, 229 diffusion components, 254 
diffusion current, 29 
CAD modelling of HEMTs, 88 diffusion triangle, 116-117 
MESFET versus HEMT, 91-92 diode-based mixers, 406-407 
non-linear capacitance equations, 90-91 diode responsivity, 270 
static current equations, 88-90 direct and indirect semiconductors, 20 
carrier confinement, 36 absorption processes, 21-24 
carrier lifetime, 226, 240-241, 248 exciton absorption, 24-25 
carrier transport in semiconductors, 29 direct bandgap semiconductors, 20 
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distributed amplification, 354-365 
amplifiers with cascode cell, 360-361 
amplifier variations, 359-360 
gain and loss in amplifiers, 358-359 
general design procedure, 356-358 
using Si/SiGe HBTs, 363-365 

distributed Bragg reflector (DBR) lasers, 

253 

distributed feedback (DFB) laser, 253 

donor, 19 

double-balanced mixer, 402-404. 

drain current, 50-51, 100-101 
backgating, 102 
constant-mobility model, 101-102 
non-ideal effects in short-channel MOSFETs, 

102-105 
velocity saturation, 105-106 

drift base, 139-140 

drift current, 29 
density, 268 

drift field in base, 134-135 

drift saturation velocity, 52 


Ebers—Moll model, 147 
edge-emitting LED, 228 
electrical pumping, 222 
electrons 
and hole distribution, 19-20 
in semiconductor, 16 
electrostatic theory, 67 
emission coefficient, 120 
emitter efficiency, 121 
emitter follower, see source follower 
epoch, 173 
exciton absorption, 24-25 
external power efficiency, 226 
extrinsic semiconductors, 18-19 


Fabry-Perot lasers, 248 

face-centred cubic lattice (FCC), 6 

facet mirror reflectivity, 230 

Fermi—Dirac distribution, 16 

fitness index, 180-181 

flat-band, 99 

FM signal, 372 

forward-biased emitter—base 
junction, 125 

free electron, 16 

fr doubler, 332 


GaAs MESFET, 48 

gain coefficient, 239-240 
gain-guided lasers, 231-233 

gain saturation sets, 371 

gate length, 49 

gate width, 49 
generation-recombination noise, 265 


gradual channel approximation, 49 
Gummel—Poon model, 147-150 


Hawkins’ theory, 133 


HBT, see heterojunction bipolar transistors (HBT) 


Heisenberg uncertainty principle, 12 
heterojunction bipolar transistors (HBT) 
base design dilemma, 46, 136-137 

drift base, 139-140 

III-V versus Si/SiGe HBTs, 146-147 

implementations, 140-143 

Si/SiGe HBTs, 143-146 

wide-gap emitter, 137-139 
heterojunction LED, 228 
heterostructures, 35-37 

band diagrams, 37-38 

band line-up, 38-39 

constructing heterostructure band diagrams, 

37-38 
lattice mismatch, 39-40 


heterostructure bipolar transistor photodetector, 284 


HICUM model, 153-155 
high electron mobility, 80 
high electron mobility transistor (HEMT), 67 
CAD modelling, 88 
MESFET versus HEMT, 91—92 
non-linear capacitance equations, 90-91 
static current equations, 88-90 
charge control, 69-74, 69-75 
channel current — constant mobility, 75-77 
channel current — constant velocity, 77-78 
Coulomb scattering, 67-69 
high electron mobility, 80 
non-ideal behaviour, 80-83 
trapping effects, 83-85 
small-signal parameters, 78-79 
structural variations, 85 
metamorphic, 87-88 
pseudomorphic structure, 86-87 
pulse-doped structure, 85-86 
high-speed lasers, 259-261 
separate confinement (SC) region, 261 
tunnelling injection laser, 261 
hill-climbing capability of SA, 166 
homojunction bipolar transistors, 115-116 
collector current equation, 117-118 
diffusion triangle, 116-117 
Early effect, 122-124 
ideal base current, 118-119 
ideal current gain, 119 
Kirk effect, 124-125 
non-ideal current contributions, 119-120 
non-ideal current gain, 120-121 
saturation, 121-122 
Hopfield recurrent neural networks 
energy function, 174 
HBTs modelling of, 174-177 
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image frequency, 407 

image-rejection mixer topologies, 407-410 
impedance inverters, 349 

impedance matching, 297-298 
index-guided lasers, 231-233 

indirect gap semiconductors, 223 

inductive source degeneration, 326 

input signal photocurrent, 270 

internal quantum efficiency, 225-226 
intrinsic and extrinsic base resistances, 193-195 
intrinsic semiconductors, 18 

ionic bonding, 8 

ion-implanted MESFET, 53 


JFET, see junction field effect transistor (JFET) 
Johnson noise, 265, 306 
junction field effect transistor (JFET), 47 


Kirchhoff’s law, 50 

Kirk effect, 124-125 

knee voltage, 51 

Kronig—Penney model, 16-17 
carriers in semiconductors, 18 
effective mass, 17-18 


Langevin components, 254 

large-signal CAD model, 55-56 
capacitance model, 56-57 
parasitic circuit elements, 57-58 

laser noise, 253-255 

lattice, 5 

lattice constant, 5 

lattice mismatch, 39-40 

LED, see light-emitting diodes (LED) 

L-I curve, 242-244, 249 

light-emitting diodes (LED) 
carrier dynamics for, 226 
mechanisms for, 222 
modulation response, 226-227 
n*—p junction, 223-224, 226-227 
optical power and, 225 
recombination rate, 222—223 
reflection coefficient, 225 
responsivity of, 226 
structure of, 227-229 
usage of, 229 

light inversion, 98 

Lilienfeld’s FET concept, 47 

linear regime, 51 

linewidth broadening factor, 252-253 

linewidth enhancement factor, see anti-guiding 

parameter 

local minimum trapping, 164 

lossless networks, 294 

lossy networks, 294 

low-frequency noise, 64 

low-noise amplifier, 365-370 


mapping, 296 
Mason’s unilateral gain, 305-306 
massively distributed computing networks, 
170-171 
maximum available gain, 344 
and maximum stable gain, 305 
maximum frequency of oscillation, 112, 130-131, 
306 
Meissner oscillator, see Armstrong oscillator 
MESFET, 46-50 
constant-velocity approximation, 51—55 
drain current, 50-51 
large-signal CAD model, 55-56 
capacitance model, 56-57 
parasitic circuit elements, 57-58 
noise performance, 62 
low-frequency noise, 64 
microwave noise, 62-64 
small-signal equivalent circuit, 58-61 
versus large-signal model, 58 
maximum frequency of oscillation, 61 
transit frequency, 60-61 
in third millennium, 64-66 
Metal-oxide-semiconductor field effect transistors 
(MOSFET), see MOSFET 
metal-semiconductor field effect transistor 
(MESFET), see MESFET 
metal—semiconductor—metal detectors, 277 
capacitance of, 278 
quantum efficiency, 279 
metamorphic HEMT, 87 
metastable region, 42 
MEXTRAM model, 152-153 
micromixer, 404—405 
microwave noise, 62-64, 112-114 
performance of bipolar transistors, 
131-134 
Miller capacitance, 58 
Miller effect, 318 
Miller indices, 7 
mirror reflectivity, 230 
mixer noise figure, 411-412 
mixers, 396 
diode-based, 406-407 
double-balanced, 402-404. 
image-rejection mixer topologies, 407-410 
micromixer, 404-405 
noise figure, 411-412 
resistive, 398-400 
single-balanced, 401-402 
transconductance multiplier, 397 
mode suppression ratio (MSR), 242 
modulated cavity photons, 252 
modulated optical power, 227 
modulation response, 226-227 
MOS diode operation, 97-100 
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MOSFET, 46 
MOS (metal oxide-semiconductor) diode, 96 
multiacronym device (MAD), 80 


multi-layer perceptron (MLP) neural networks, 171 


backpropagation, 171-173 
circuit design process, flow chart for, 173 
epoch, 173 


neural networks and modelling, 168-179 
ANN, 168-169 
classes of, 169 
Hopfield recurrent neural networks, 174-179 
massively distributed computing networks, 
170-171 
multi-layer perceptron neural networks, 
171-174 
neuron, 169 
physical implementation of, 171 
structure of, 170 
neural networks by genetic algorithm, optimisation 
of, 180 
chromosomes and genes, 180 
crossover, 180-181 
fitness index, 180 
flow chart for application of, 181 
HBT 
I-V characteristics of, 177-178 
large-signal neural network model of, 177 
mutation, 180 
over-fitting, 181 
neurons, 169-171 
Neuron Transfer Function, 171 
noise, 393-396 
noise in two-ports 
figure, 307-309 
figure with arbitrary generator admittance, 
310-313 
phenomena, 306-307 
noise equivalent power (NEP), 265 
noise figure, 307-313 
with arbitrary generator admittance, 
310-312 
of cascaded two-ports, 309-310 
noise in two-ports 
noise figure, 307-309 
noise figure with arbitrary generator admittance, 
310-312 
noise phenomena, 306-307 
noise performance, 62 
low-frequency noise, 64 
microwave noise, 62-64 
noise phenomena, 306-307 
non-linearity in oscillators, 387-388 
non-radiative recombination coefficient, 240 
normalised detectivity, 265 
nt—p junction, 223-224, 226-227 


427 
optical generation rate, 273 
optical sources, 221 
light-emitting diodes, 222-229 
preliminaries, 222 
semiconductor lasers optical waveguides, 
231-239 
Optimisation and parameter extraction of circuit 
models 


device models, 163—164 
neural networks applied to modelling, 168-170 
Hopfield recurrent neural networks, 174-179 
massively distributed computing networks, 
170-171 
multi-layer perceptron neural networks, 
171-174 
neural networks by genetic algorithm, 180-184 
semi-analytical device parameter extraction 
analysis, 184-198 
parameter extraction results, 198-203 
simulated annealing, 164-168 
small-signal model of the collector-up (inverted) 
HBT, 212-214 
small-signal parameter extraction, 209-211 
Z-parameters at zero bias, 208-209 
Z-parameters for HBT, theoretical 
approximations, 203-208 
structured genetic algorithm, 181-184 
oscillation frequency, 392 
oscillators, 383 
noise, 393-396 
non-linearity in oscillators, 387-388 
oscillator topologies, 388-393 
resonators, 383-385 
self-excitation criteria, 385-388 
oscillator topologies, 388-393 
over-fitting, 181 


parasitic elements extraction of, 
186-189 
passive baluns and unbals, 412-414 
photoconductor detector, 263, 266 
generation-recombination noise, 265 
with interdigitated fingers, 266 
noise equivalent power (NEP), 265 
normalised detectivity, 265 
photoconductor slab, 263 
signal-to-noise ratio, 265 
photodetectors, 261-262 
avalanche photodiodes, 271-277 
heterostructure bipolar transistor photodetector, 
284 
metal—semiconductor—metal detectors, 277-279 
photoconductor detector, 263-265 
P-I-N diodes, 265-271 
responsivity of, 262 
travelling wave p-i—n photodiodes, 279-283 
photoelectric effects, 9-10 
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photon density, 240 
photon flux density, 267 
photon lifetime, 240-241 
photons, 9 
pinch-off voltage, 50-51 
P-I-N diodes, 265-271 
conduction current density, 268-269 
diffusion current density, 268 
drift current density, 267-268 
p-i-n junction photodiode, 267 
equivalent circuit of, 270 
Planck’s constant, 9, 10 
p-n junction(s), 30-31, 222 
under bias, 33 
built-in potential, 32 
current-voltage characteristics, 33-34 
depletion capacitance, 33 
depletion layer width, 32-33 
photodiode, 267 
Poisson’s equation, 54 
polycrystalline solids, 5 
poly-Si plug, 128 
port isolation, 400 
port matching, 325-326 
power amplifiers, 376-383 
class D amplifier, 379-381 
class E and F amplifiers, 381-383 
classes of operation, 376-377 
switched amplifiers, 377-379 
power gains 
for amplifier design, 298-301 
definitions, 301-302 
power output against current, 249 
powers at input and load, 300-301 
probability and uncertainty principle, 12-13 
propagation constant, 230 
pseudomorphic HEMT structure, 86-87 
pseudo-temperature, 165 
pulse-doped HEMT, 85-86 
pulse-doped MESFET, 53 


quadrature generation, 417-418 
quanta, 9 
quantum dot lasers, 255-261 
quantum mechanics, 12 
probability and uncertainty principle, 12-13 
wave equation, 13-15 
quantum well 
lasers, 229, 255-259, 261 
and quantum dot lasers, 258-261 


radial relaxation oscillation frequency, 250 
radiative recombination, 20 

coefficient, 223, 240 

rate, 222 
radio frequency MOSFETs, 95-96 

drain current, 100-101 


backgating, 102 
constant-mobility model, 101-102 


non-ideal effects in short-channel MOSFETs, 


102-104 
velocity saturation, 105-106 
large-signal modelling, 106-109 
MOS diode operation, 97-100 
small-signal model and RF performance, 
109-110 
maximum frequency of oscillation, 112 
microwave noise, 112-114 
transit frequency, 110—112 
structure, 96 
reciprocal mixing, 393 
reciprocity, 294 
recombination rate, 222—223, 241 
reference planes, 294-295 
relative intensity noise (RIN), 253 
relaxation frequency, 251 


relaxation oscillation frequency, 221, 249-251, 254, 


260-261 
resistive mixer, 398-404 
resonators, 383-385 
quality factor, 384-385 
ridge laser, 232-233, 237 
Rollet factor, 304, 305 


SA, see simulated annealing (SA) 


SAM, see separate absorption and multiplication 


(SAM) 
saturated regime, 51 
saturation, 122 
scattering parameter theory, 291-295 
Schottky diode, 34-35, 49 
Schrédinger wave equation, 14 
second medium index, 231 
second-order intermodulation, 396 
self-excitation criteria, 385-386 
semi-analytical device parameter extraction 
equivalent circuit elements, extraction of 
base—collector capacitance, 189-190 
base—collector resistance, 190-192 
base contact lead inductor, 193 
collector contact lead inductor, 190 
collector extrinsic resistance, 192-193 
emitter lead inductor and base-emitter 
capacitance, 195-196 


emitter resistance and base-emitter resistance, 


195 
intrinsic and extrinsic base resistances, 
193-195 
parasitic elements extraction of, 186-189 
transport factor, 196-198 
parameter extraction results, 198, 201-202 
bias-dependent parameters, 200-201 
bias-independent parameters, 199 
optimisation error, 199-200 


More free ebooks : http://fast-file.blogspot.com 


Building blocks for high-speed analogue circuits 


semiconductors 
band diagrams, 22 
carriers in, 18 
elemental and binary compound, 4 
in equilibrium 
electron and hole distribution, 19-20 
extrinsic, 18-19 
intrinsic, 18 
materials, 4 
in periodic table, 4 
recombination and radiation in, 25-26 
spontaneous and stimulated emission, 26-28 
ternary and quaternary, 4 
semiconductor heterostructures, central design 
principle of, 40 
semiconductor lasers, 229 
absorption, emission and gain, calculation of, 244 
classification, 231-232 
concepts of, 229-231 
confinement factor, 230-231 
emission characteristics of, 238 
carrier lifetime, 240 
gain coefficient, 239-240 
L-I curves, 242 
photon lifetime, 240-241 
high-speed lasers, 259-261 
laser noise, 253-255 
mirror reflectivity, 230 
optical waveguides in, 231—238 
quantum well and quantum dot lasers, 255-259 
rate equations, 244-248 
steady-state and dynamic characteristics, 248-253 
cavity photons, 252-253 
linewidth broadening factor, 252-253 
power output against current, 249 
radial relaxation oscillation frequency, 250, 
251 
separate absorption and multiplication (SAM), 275 
separate confinement (SC) region, 261 
series—series feedback, 324 
Shockley—Read—Hall recombination, 26 
short-base diode condition, 117 
shunt—shunt feedback, 324 
signal-to-noise ratio, 265, 271 
silicon—germanium heterostructures, 40-43 
simulated annealing (SA), 164 
hill-climbing capability of, 166 
modelling of HEMT, application of, 166-168 
pseudo-temperature, 165 
single-balanced mixer, 401-402 
single quantum well vertical cavity laser, 259 
single-sideband noise figure, 411 
single-tone excitation, 370-371 
Si/SiGe HBTs, 143-146 
III-V versus Si/SiGe HBTs, 146-147 
slab waveguide, 232 
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small-signal amplifiers, 324—325 
small-signal dynamic behaviour, 125-129 
maximum frequency of oscillation, 130-131 
transit frequency, 129-130 
small-signal equivalent circuit, 58-61 
versus large-signal model, 58 
maximum frequency of oscillation, 61 
transit frequency, 60-61 
small-signal model and RF performance, 109-110 
small-signal model of the collector-up (inverted) 
HBT, 212-214 
small-signal parameter extraction, expressions 
approximation at Rp; = 0, 212 
parameter extraction, 209-211 
Z-parameters 
for HBT, theoretical approximations, 203-208 
at zero bias, 208-209 
small-signal photons in cavity, 252-253 
Smith chart, 295-297 
solids, types of, 5 
source-coupled amplifier, 341-342 
source follower, 322 
space charge region, 30 
spontaneous emission, 26-28 
factor, 247 
spreading resistance, 113 
stability, 308-305 
circles, 303-305 
Rollet’s stability factor, 304-305 
unconditional, 303 
statistical mechanics 
Fermi—Dirac distribution, 16 
free electron, 16 
stimulated emission, 26-28 
stimulated transition, 27 
strong inversion, 98 
structured genetic algorithm (SGA) 
modelling of HEMT amplifier, application of, 182 
parameters for, 184 
for neural networks, 183 
simplified power amplifier schematic, 183 
supply layer, 75 
surface-emitting LED, 229 
surface recombination, 223 


thermal noise, see Johnson noise 

three-layer slab symmetric guide, 232, 237 

threshold current density, 240 

total cavity photons, 252 

transconductance multiplier, 397 

transistor amplifiers, 313-314 
broadband amplifier techniques, 350-354 
configurations, 314-323 
configurations with two transistors, 329-336 
differential amplifiers, 336-341 
distributed amplification, 354-365 
feedback, 323-326 
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transistor amplifiers (cont.) 
linearity, 370-376 
low-noise amplifier, 365-370 
power amplifiers, 376-383 
source-coupled amplifier, 341-342 
tuned amplifiers, 342-350 
transit frequency, 60-61, 110-112, 129-130 
transit time optimisation 
collector, 135-136 
drift field in base, 134-135 
transmission line, 382—383 
transport factor a, 196-198 
transverse electric field, 24 
trapezoidal Germanium profile, 145 
trapping effects, 83-85 
travelling wave p—i—n photodiodes 
with coplanar electrodes, 280 
equivalent circuit of, 282 
and p-i-n structure, 281-282 
velocity mismatch, 282-283 
waveguide photodetector (WPD), 280 
tuned amplifiers, 342—350 
input and output matching networks, 344-350 
resonant loads, 342-344 
tunnelling injection laser, 261 
two-dimensional electron gas (2DEG), 70 
two-port networks, basic relations for 
impedance matching, 297-298 
Mason’s unilateral gain, 305-306 
maximum available gain and maximum stable 
gain, 305 
maximum frequency of oscillation, 306 


power gains for amplifier design, 298-299 
scattering parameter theory, 291-295 
Smith chart, 295-303 
stability, 302-305 
two-tone excitation, 372-375 
two-tone second-order intermodulation products, 
373 
two-tone third-order intermodulation products, 373 


ultrafast lasers, 229 
under bias, 33 
unilateral gain, 305 
unilateralisation, 325 
unloaded Q, 385 


VBIC95 model, 150 

Vertical cavity surface emitting lasers (VCSELs), 
258, 259, 270 

virtual ground, 337 


wall plug efficiency, see external power efficiency 
wave equation, 13-15 

waveguide mode refractive index, 238 

waveguide photodetector (WPD), 280 

wide-gap emitter, 137-139 

WPD, see waveguide photodetector (WPD) 


zero-gap configuration, 39 

Z-parameters 
for HBT, theoretical approximations, 203-208 
at zero bias, 208-209 
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